Grok 4.1 Fast Non-Reasoning
Chat Completions
grok-4.1-fast-non-reasoning
Chat Completions
Grok 4.1 Fast Non-Reasoning is xAI's high-speed variant optimized for instant responses and straightforward queries, featuring a 2M token context window. Designed for production workflows requiring rapid inference without deep reasoning overhead, it maintains strong tool-calling capabilities and multi-turn consistency while delivering faster response times. Ideal for real-time applications, customer-facing chatbots, and scenarios where speed is critical, Grok 4.1 Fast Non-Reasoning balances performance with cost-effectiveness for efficient, production-ready agent deployments.
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Input Tokens (1M)
$0.20
Cached Input Tokens (1M)
$0.05
Output Tokens (1M)
$0.50
Capabilities
Input Modalities
Text
Image
Output Modalities
Text
Supported Parameters
Available parameters for API requests
Max Completion Tokens
Parallel Tool Calls
Response Format
Temperature
Tool Choice
Tools
Top P
Web Search Options