Skip to content
Dashboard

GLM 5.2 Fast

Fast version of GLM 5.2 with 120-250 TPS.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-5.2-fast',
prompt: 'Why is the sky blue?'
})
Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.