
GLM 4.5 Air
Zhipu AI
glm-4.5-airCheap agentic MoE (106B/12B active). Fast with implicit caching.
Best for: Bulk agent, long context, cheap
At a glance
Context window
131K
Input
$0.18/1M
$0.02/1M cached
Output
$1.15/1M
Speed
fast
Quality tier
strong
Per 1M tokens. All-in pricing, no hidden fees.
Capabilities
Tool calling
Connect to external tools and APIs
Reasoning
Extended thinking for complex problems
Prompt caching
Cache repeated prefixes for 90% discount
Streaming
Real-time token-by-token output
Structured outputs
JSON mode and function calling
Quick start
Get API Keycurl https://kymaapi.com/v1/chat/completions \
-H "Authorization: Bearer $KYMA_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "glm-4.5-air",
"messages": [{"role": "user", "content": "Hello"}]
}'Details
CreatorZhipu AI
Model IDglm-4.5-air
Quality tierstrong
Cost tiercheap
Input modalityText
Output modalityText
Prompt cachingSupported
Try GLM 4.5 Air now
$0.50 free credits on signup. No credit card required.