zai

GLM 4.5 AirX

zai/glm-4.5-airx

GLM 4.5 AirX represents the sweet spot between intelligence and efficiency. It inherits the 4.5 knowledge corpus yet fits inside an eight‑billion active parameter envelope through selective activation and gated experts. With advanced quantization and compressive adapters, it runs at near X speed while staying within Air budget constraints. Benchmarks show eighty‑five percent of flagship accuracy on reasoning and coding while costing less than half to operate. It is ideal for conversational search, code review, and medium‑length content generation where quality matters but every inference cycle is billed. Smart scheduling APIs further optimize batch utilization across mixed hardware.

Tools

Function Calling

Context Window

128,000

Max Output Tokens

96,000

ProviderInput Token PriceOutput Token Price
zai$0.50/Million Tokens$2/Million Tokens