GLM 5.2 Fast
Fast version of GLM 5.2 with 120-250 TPS.
ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({ model: 'zai/glm-5.2-fast', prompt: 'Why is the sky blue?'})Playground
Try out GLM 5.2 Fast by Z.ai. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Ask GLM 5.2 Fast anything to try it out.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
Throughput24 hours
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
Latency24 hours
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Uptime24 hours
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by Z.ai
| Model |
|---|