Qwen 3.6 Max Preview

Qwen 3.6 Max Preview is Alibaba's early-access release of the Qwen 3.6 Max line, a preview build that focuses on coding-agent execution, front-end development quality, and long-tail knowledge over a context window of 240K tokens.

ReasoningTool UseImplicit CachingFile Input

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen-3.6-max-preview',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Qwen 3.6 Max Preview by Alibaba. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Qwen 3.6 Max Preview

Ask Qwen 3.6 Max Preview anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Alibaba

240K

1.6s

73tps

$1.30/M

$7.80/M

Read:

$0.26/M

Write:

$1.63/M

—

04/20/2026

More models by Alibaba

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

alibaba/qwen3.7-plus

0.7s

327tps

$0.32/M

$1.28/M

Read:$0.08/M

Write:$0.5/M

—

06/01/2026

alibaba/qwen3.7-max

991K

2.4s

55tps

$1.25/M

$3.75/M

Read:$0.25/M

Write:$1.56/M

—

05/21/2026

alibaba/qwen3.6-plus

1.5s

109tps

$0.50/M

$3.00/M

Read:

$0.1/M

Write:

$0.63/M

—

04/02/2026

alibaba/qwen3.5-flash

1.4s

161tps

$0.10/M

$0.40/M

Read:$0.0/M

Write:$0.13/M

—

02/24/2026

alibaba/qwen3-embedding-0.6b

33K

$0.01/M

—

11/14/2025

alibaba/qwen3-embedding-8b

33K

$0.05/M

—

06/05/2025

About Qwen 3.6 Max Preview

Qwen 3.6 Max Preview is the preview entry in the Qwen 3.6 Max line, succeeding the previously released Qwen3-Max and complementing the Plus-tier Qwen3.6-Plus. Compared with those models, Qwen 3.6 Max Preview targets a different center of gravity: coding agents that need to plan and act over long horizons, front-end development work that demands accurate component output, and broader long-tail knowledge retention.

The model is served as a closed-weight API release with a context window of 240K tokens and a maximum output of 64K tokens, which gives it room for full repository context, multi-file diffs, and extended agent traces without segmentation. Tagged capabilities include reasoning, tool use, implicit caching, file input, and vision, so it can ingest images and document files alongside text within the same request.

The preview designation signals that Alibaba is iterating on Qwen 3.6 Max Preview ahead of its production release. Teams that integrate against the preview can validate prompt templates, evaluate output formatting for front-end scaffolding, and benchmark coding-agent loops against the new capability profile while AI Gateway handles provider routing and authentication. Because AI Gateway abstracts the underlying endpoint, migrating to the GA model identifier is a single configuration change.

What To Consider When Choosing a Provider

Configuration: Preview models can shift in capability or rate-limit behavior before general availability. Validate prompts and tool schemas against the preview, but plan a regression pass before migrating production traffic to the GA identifier.
Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Qwen 3.6 Max Preview

Best For

Long-Horizon Coding Agents: Pipelines that plan and act across many tool-call turns with a context window of 240K tokens
Front-End Development Work: Tasks where the model produces accurate component code, layout, and styling output
Pre-GA Evaluation: Validating prompts and tool schemas against a near-final Max-tier model before production rollout
Multimodal Coding Workflows: Combining screenshots, design references, and text in a single request via file input and vision
Repository And Document Ingestion: Long-context tasks that load full codebases or document plus image inputs in one call

Consider Alternatives When

GA Stability Required: Wait for the Qwen 3.6 Max general availability release when production guarantees are needed
Smaller Tier Sufficient: Qwen3.6-Plus is a closer fit when the Plus line already meets your quality bar at lower cost
Visible Reasoning Traces: Qwen3-Max-Thinking is a stronger match when chain-of-thought transparency drives evaluation
Text-Only Pipelines: A dedicated text model offers lower cost per token when vision and file input go unused

Conclusion

Qwen 3.6 Max Preview extends the Qwen Max line with coding-agent and front-end focus, available through AI Gateway ahead of general availability. Routing through AI Gateway means migrating from the preview identifier to the GA model is a configuration change rather than an integration rewrite.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen 3.6 Max Preview

Playground

Providers

More models by Alibaba

About Qwen 3.6 Max Preview

What To Consider When Choosing a Provider

When to Use Qwen 3.6 Max Preview

Best For

Consider Alternatives When

Conclusion