Skip to content
Dashboard

Qwen 3.6 Max Preview

Qwen 3.6 Max Preview is Alibaba's early-access release of the Qwen 3.6 Max line, a preview build that focuses on coding-agent execution, front-end development quality, and long-tail knowledge over a context window of 240K tokens.

ReasoningTool UseImplicit CachingFile Input
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen-3.6-max-preview',
prompt: 'Why is the sky blue?'
})

Playground

Try out Qwen 3.6 Max Preview by Alibaba. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

alibaba logo
alibaba logo

Ask Qwen 3.6 Max Preview anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Alibaba
240K
1.6s
73tps
$1.30/M
$7.80/M
Read:
$0.26/M
Write:
$1.63/M
——
+2
04/20/2026
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by Alibaba

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
0.7s
327tps
$0.32/M$1.28/M
Read:$0.08/M
Write:$0.5/M
——
+3
alibaba logo
fireworks logo
togetherai logo
06/01/2026
991K
2.4s
55tps
$1.25/M$3.75/M
Read:$0.25/M
Write:$1.56/M
——
+2
alibaba logo
05/21/2026
1M
1.5s
109tps
$0.50/M
$3.00/M
Read:
$0.1/M
Write:
$0.63/M
——
+3
alibaba logo
fireworks logo
togetherai logo
04/02/2026
1M
1.4s
161tps
$0.10/M$0.40/M
Read:$0.0/M
Write:$0.13/M
——
+3
alibaba logo
02/24/2026
33K
$0.01/M——
deepinfra logo
11/14/2025
33K
$0.05/M——
deepinfra logo
06/05/2025

About Qwen 3.6 Max Preview

Qwen 3.6 Max Preview is the preview entry in the Qwen 3.6 Max line, succeeding the previously released Qwen3-Max and complementing the Plus-tier Qwen3.6-Plus. Compared with those models, Qwen 3.6 Max Preview targets a different center of gravity: coding agents that need to plan and act over long horizons, front-end development work that demands accurate component output, and broader long-tail knowledge retention.

The model is served as a closed-weight API release with a context window of 240K tokens and a maximum output of 64K tokens, which gives it room for full repository context, multi-file diffs, and extended agent traces without segmentation. Tagged capabilities include reasoning, tool use, implicit caching, file input, and vision, so it can ingest images and document files alongside text within the same request.

The preview designation signals that Alibaba is iterating on Qwen 3.6 Max Preview ahead of its production release. Teams that integrate against the preview can validate prompt templates, evaluate output formatting for front-end scaffolding, and benchmark coding-agent loops against the new capability profile while AI Gateway handles provider routing and authentication. Because AI Gateway abstracts the underlying endpoint, migrating to the GA model identifier is a single configuration change.

What To Consider When Choosing a Provider

  • Configuration: Preview models can shift in capability or rate-limit behavior before general availability. Validate prompts and tool schemas against the preview, but plan a regression pass before migrating production traffic to the GA identifier.
  • Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Qwen 3.6 Max Preview

Best For

  • Long-Horizon Coding Agents: Pipelines that plan and act across many tool-call turns with a context window of 240K tokens
  • Front-End Development Work: Tasks where the model produces accurate component code, layout, and styling output
  • Pre-GA Evaluation: Validating prompts and tool schemas against a near-final Max-tier model before production rollout
  • Multimodal Coding Workflows: Combining screenshots, design references, and text in a single request via file input and vision
  • Repository And Document Ingestion: Long-context tasks that load full codebases or document plus image inputs in one call

Consider Alternatives When

  • GA Stability Required: Wait for the Qwen 3.6 Max general availability release when production guarantees are needed
  • Smaller Tier Sufficient: Qwen3.6-Plus is a closer fit when the Plus line already meets your quality bar at lower cost
  • Visible Reasoning Traces: Qwen3-Max-Thinking is a stronger match when chain-of-thought transparency drives evaluation
  • Text-Only Pipelines: A dedicated text model offers lower cost per token when vision and file input go unused

Conclusion

Qwen 3.6 Max Preview extends the Qwen Max line with coding-agent and front-end focus, available through AI Gateway ahead of general availability. Routing through AI Gateway means migrating from the preview identifier to the GA model is a configuration change rather than an integration rewrite.