Review GPT-4.1 Nano on key metrics including price, context length, throughput, and model features.
AuthorOpenAI
Context Length1.0M
Supports Tools
The fastest and most cost-effective model in the GPT-4.1 series, designed for tasks demanding low latency such as classification and autocompletion. Maintains a 1 million token context window and delivers exceptional performance at a small size, outperforming even some larger models on key benchmarks.