Skip to content

Kimi K2 0905

moonshotai/kimi-k2-0905

Kimi K2 0905 is Moonshot AI's September 2025 K2 checkpoint, a refined release focused on agentic coding workflows with a context window of 256K tokens, available through AI Gateway via fireworks.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'moonshotai/kimi-k2-0905',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Zero Data Retention

    AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.

    Authentication

    AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

This checkpoint routes across fewer providers than the base K2. Monitor provider-level status during high-demand periods if you observe elevated latency.

When to Use Kimi K2 0905

Best For

  • Long agentic sessions:

    Accumulated context (tool outputs, file contents, multi-turn history) pushes beyond the base K2 context window

  • September 5, 2025 training refinements:

    Workloads targeting the newer checkpoint's agentic coding improvements

  • Full-codebase review:

    Multi-file code review or generation where the context of 256K tokens enables a complete codebase view in one call

  • Drop-in upgrade:

    Existing base K2 integrations seeking a direct upgrade to the newer checkpoint

Consider Alternatives When

  • Chain-of-thought traces:

    Kimi K2 Thinking variants are designed for explicit reasoning output

  • Maximum routing redundancy:

    Base Kimi K2 routes across a wider provider set than this checkpoint

  • Fastest K2 inference:

    Kimi K2 Turbo is the speed-optimized variant

  • Shorter context needs:

    Tasks that don't require the full 256K tokens benefit from base K2's broader failover pool

Conclusion

Kimi K2 0905 delivers September 5, 2025 training refinements for agentic coding alongside a context window of 256K tokens that accommodates the long histories of extended coding agent sessions. For teams running base K2 in agentic coding workflows, it's the checkpoint update with the larger context window. Switch by changing the model string to moonshotai/kimi-k2-0905 with no other integration changes.

FAQ

Agentic coding. The checkpoint refines multi-step development tasks, tool use in coding workflows, and sustained context across long coding sessions.

Coding agents accumulate context rapidly: file contents, function signatures, test outputs, error logs, and multi-turn reasoning traces all consume tokens. A window of 256K tokens keeps a much larger project scope in context at once, which cuts truncation workarounds.

Update the model string in your API call to moonshotai/kimi-k2-0905. Authentication, tool-calling format, and the rest of the integration stay the same.

AI Gateway routes Kimi K2 0905 across fireworks. Failover between them is automatic.

Yes, in the same lineage as other open-weight K2-family models. Check Moonshot AI's Hugging Face repository for license terms specific to this checkpoint.

Yes. Tool calling through the standard function-calling interface matches the agentic coding focus of the 0905 training refinements.

If context length isn't a constraint, the base Kimi K2 routes across a wider provider set and may give more availability headroom for high-uptime production use.