Grok 4 Fast Non-Reasoning

Chat Completions

grok-4-fast-non-reasoning

xAI|Created Sep 20, 2025|2M context

Chat Completions

A blazing fast variant for instant, cost-effective answers without reasoning traces. Built on the same Grok 4 Fast backbone for unified quality and efficiency, it excels at search, summarization, Q&A, and lightweight agent use. Delivers low latency, reduced token cost, and supports the 2M token context for long inputs. Perfect for rapid and scalable information workflows.

Overview Specifications Activity Performance Uptime Examples API Reference

Compare

Throughput

Not enough throughput data

Time-To-First-Token (TTFT)