A blazing fast variant for instant, cost-effective answers without reasoning traces. Built on the same Grok 4 Fast backbone for unified quality and efficiency, it excels at search, summarization, Q&A, and lightweight agent use. Delivers low latency, reduced token cost, and supports the 2M token context for long inputs. Perfect for rapid and scalable information workflows.
This page collects the public integration surface for the model: supported endpoints, available request parameters, and example calls through the NagaAI API.
Chat Completions