A mid-sized GPT-4.1 model delivering performance competitive with GPT-4o at substantially lower latency and cost. Retains a 1 million token context window and demonstrates strong coding ability and vision understanding, making it suitable for interactive applications with tight performance constraints.
This page collects the public integration surface for the model: supported endpoints, available request parameters, and example calls through the NagaAI API.
Chat Completions