Z.ai: GLM 5 Turbo
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...
Anyone in the Space can @-mention Z.ai: GLM 5 Turbo with the team's shared context — pooled credits, one chat, one memory.
Starter is free forever — 1 Space, 100 credits/month, 1 MCP. No card.
Specifications
- Provider
- z-ai
- Category
- llm
- Context length
- 202,752 tokens
- Max output
- 131,072 tokens
- Modalities
- text
- License
- proprietary
- Released
- 2026-03-15
Pricing
- Input
- $1.20/Mtok
- Output
- $4.00/Mtok
- Model ID
z-ai/glm-5-turbo
Per-token prices show what the model costs upstream. On Switchy your team draws from one shared org credit pool — one plan, one balance for everyone.
Team cost calculator
5 seats · 80 msgs/day
Switchy meters this against your org's shared credit pool — one plan, one balance for everyone.
Providers
| Provider | Context | Input | Output | P50 latency | Throughput | 30d uptime |
|---|---|---|---|---|---|---|
| z-ai | 203k | $1.20/Mtok | $4.00/Mtok | — | — | — |
Performance
Benchmarks
Works well with
Top MCPs
Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.