Kimi K2 Thinking

kimi-k2-thinking
byMoonShotAI|Created Nov 6, 2025
Chat Completions

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model, extending the K2 series into agentic, long-horizon reasoning. Built on a trillion-parameter Mixture-of-Experts (MoE) architecture, it activates 32 billion parameters per forward pass and supports a 256k-token context window. Optimized for persistent step-by-step thought and dynamic tool use, it enables complex reasoning workflows and stable multi-agent behavior across 200–300 tool calls, setting new open-source records on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench. With MuonClip optimization and large-scale MoE architecture, it delivers strong reasoning depth and high inference efficiency for demanding agentic and analytical tasks.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.30

Cached Input Tokens (1M)

$0.07

Output Tokens (1M)

$1.25

Capabilities

Input Modalities

Text

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Frequency Penalty
Max Completion Tokens
Presence Penalty
Response Format
Stop
Temperature
Tool Choice
Tools
Top P