GLM 4.7 Flash

LLM

Z.AI

GLM 4.7 Flash is a compact open-weight model optimized for lightweight deployment, balancing efficiency with strong coding, agentic task planning, and tool collaboration capabilities in a 30B class.

Context tokens

203,000

Output tokens

16,384

Docs

Model documentation

Schema

OpenAI-compatible chat format.

Schema documentation

Capabilities

Thinking

Multilingual

Supported languages

Supported tools

Seclai Web Tools
Fetch web pages and search the web from within agent prompt calls. Includes seclai_web_fetch for retrieving page content in markdown, HTML, or plain text, and seclai_web_search for finding relevant pages with content snippets.

Pricing

Type	Credits	Units
Input	0.93	Credits per 1k tokens
Output	5.32	Credits per 1k tokens

Variants