Grok 4 Fast Non-Reasoning
Chat Completionsgrok-4-fast-non-reasoning
Chat Completions
A blazing fast variant for instant, cost-effective answers without reasoning traces. Built on the same Grok 4 Fast backbone for unified quality and efficiency, it excels at search, summarization, Q&A, and lightweight agent use. Delivers low latency, reduced token cost, and supports the 2M token context for long inputs. Perfect for rapid and scalable information workflows.
Throughput
Time-To-First-Token (TTFT)
Not enough TTFT data