Skip to content
Dashboard

Mistral Large 3

Mistral Large 3 is a large-scale MoE model from Mistral AI, using a sparse mixture-of-experts architecture with 41B active parameters out of 675B total, the company's first MoE release since the Mixtral series.

Tool UseVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'mistral/mistral-large-3',
prompt: 'Why is the sky blue?'
})

Playground

Try out Mistral Large 3 by Mistral AI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

mistral logo
mistral logo

Ask Mistral Large 3 anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Mistral AI
256K
0.5s
53tps
$0.50/M$1.50/M——
12/02/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by Mistral AI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
256K
0.3s
89tps
$1.50/M$7.50/M——
+1
mistral logo
04/29/2026
$0.15/M——
mistral logo
05/28/2025
128K
0.3s
190tps
$0.10/M$0.10/M——
mistral logo
10/16/2024
128K
0.3s
83tps
$0.15/M$0.15/M——
mistral logo
10/16/2024
32K
0.3s
196tps
$0.10/M$0.30/M——
mistral logo
09/17/2024
$0.10/M——
mistral logo
12/11/2023

About Mistral Large 3

Announced December 2, 2025, Mistral Large 3 marks Mistral AI's return to the mixture-of-experts (MoE) architecture that defined their earlier Mixtral series, now at a larger scale. With 675B total parameters and 41B active per forward pass, Mistral Large 3 represents a substantial architectural evolution from the dense models that preceded it in the Large lineage.

The sparse MoE design lets Mistral Large 3 maintain inference efficiency comparable to a smaller dense model while drawing on a large total parameter pool for complex tasks. This architecture offers a tradeoff between capability and inference cost.

Through AI Gateway, you can access Mistral Large 3 without separate Mistral AI API credentials. Built-in observability gives you cost and latency visibility across every request.

What To Consider When Choosing a Provider

  • Configuration: Mistral Large 3's return to MoE architecture brings sparse activation, where only part of the total parameters run per token, to Mistral AI's largest general-purpose open release as of the Mistral 3 announcement.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Mistral Large 3

Best For

  • High-capability MoE tasks: Demanding Mistral AI's general-purpose MoE lineup
  • Complex reasoning and analysis: Tasks that benefit from a large total parameter pool
  • Long-form content generation: Long outputs where coherent multi-step logic has to hold across the whole piece
  • Mistral AI ecosystem fit: Applications that rely on its tooling, fine-tuning, or enterprise agreements
  • MoE inference efficiency: Workflows preferred over pure dense-model approaches

Consider Alternatives When

  • Explicit chain-of-thought reasoning: Your task requires reasoning traces (consider Magistral Medium)
  • Primary cost constraint: Mistral Small or a Ministral variant meets accuracy requirements at lower per-token cost than the 675B flagship
  • Vision capabilities: You need multimodal input (consider Pixtral Large)

Conclusion

Mistral Large 3 brings back sparse MoE at a larger scale than Mixtral. For teams that want Mistral AI's largest general-purpose open MoE with 41B active parameters per forward pass, it fills that tier.