This page collects the public integration surface for the model: supported endpoints, available request parameters, and example calls through the NagaAI API.
Chat CompletionsThe fastest and most cost-effective model in the GPT-4.1 series, designed for tasks demanding low latency such as classification and autocompletion. Maintains a 1 million token context window and delivers exceptional performance at a small size, outperforming even some larger models on key benchmarks.