Grok 4.1 Fast Reasoning

grok-4.1-fast-reasoning
byxAI|Created Nov 20, 2025
Chat Completions

Grok 4.1 Fast Reasoning is xAI's most capable tool-calling model, engineered for production-grade agentic applications with a 2M token context window. Achieving state-of-the-art results on Berkeley Function Calling v4 and leading agentic search benchmarks like Research-Eval Reka (63.9) and FRAMES (87.6), it excels at multi-turn conversations, long-horizon planning, and autonomous task execution. Built through RL training in real-world simulated environments, Grok 4.1 Fast Reasoning delivers exceptional performance on complex enterprise scenarios like customer support and finance while cutting hallucination rates in half compared to its predecessor.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.20

Cached Input Tokens (1M)

$0.05

Output Tokens (1M)

$0.50

Capabilities

Input Modalities

Text
Image

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Max Completion Tokens
Parallel Tool Calls
Response Format
Temperature
Tool Choice
Tools
Top P
Web Search Options