GLM 5 is a flagship reasoning model from Z.AI excelling at complex reasoning, multilingual programming, and agentic workflows with interleaved thinking, 200K context, and improved tool collaboration.
OpenAI-compatible chat format via Google Vertex AI MaaS. GLM 5 uses a GLM-4-compatible schema; see the documentation URL.
Schema documentationSupported languages
No tools enabled.
| Type | Credits | Units |
|---|---|---|
| Input | 13.30 | Credits per 1k tokens |
| Output | 42.56 | Credits per 1k tokens |
| Cache hit | 1.33 | Credits per 1k tokens |
No variants available for this model.