Skip to content
Dashboard

Ministral 8B

Ministral 8B brings an interleaved sliding-window attention architecture to edge inference, delivering faster and more memory-efficient processing across its full context window of 128K tokens at $0.15 per million tokens.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'mistral/ministral-8b',
prompt: 'Why is the sky blue?'
})

More models by Mistral AI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
256K
0.4s
66tps
$1.50/M$7.50/M
+1
mistral logo
04/29/2026
256K
0.4s
61tps
$0.50/M$1.50/M
mistral logo
12/02/2025
$0.15/M
mistral logo
05/28/2025
128K
0.3s
151tps
$0.10/M$0.10/M
mistral logo
10/16/2024
32K
0.4s
183tps
$0.10/M$0.30/M
mistral logo
09/17/2024
$0.10/M
mistral logo
12/11/2023