Kimi K2 Thinking

Chat Completions

kimi-k2-thinking

MoonShotAI|Created Nov 6, 2025|262.1k context

Chat Completions

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model, extending the K2 series into agentic, long-horizon reasoning. Built on a trillion-parameter Mixture-of-Experts (MoE) architecture, it activates 32 billion parameters per forward pass and supports a 256k-token context window. Optimized for persistent step-by-step thought and dynamic tool use, it enables complex reasoning workflows and stable multi-agent behavior across 200–300 tool calls, setting new open-source records on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench. With MuonClip optimization and large-scale MoE architecture, it delivers strong reasoning depth and high inference efficiency for demanding agentic and analytical tasks.

Overview Specifications Activity Performance Uptime Examples API Reference

Compare

Pricing-50%

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.25

Cached Input Tokens (1M)

$0.07

Output Tokens (1M)

$1.25

Capabilities

Input Modalities

Text

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Frequency PenaltyMax Completion TokensPresence PenaltyResponse FormatStopTool ChoiceToolsTop PReasoning EffortTemperatureLogit BiasLogprobs