GPT-4.1 Nano

Chat Completions

gpt-4.1-nano-2025-04-14

OpenAI|Created May 25, 2025|1.0M context

Chat Completions

The fastest and most cost-effective model in the GPT-4.1 series, designed for tasks demanding low latency such as classification and autocompletion. Maintains a 1 million token context window and delivers exceptional performance at a small size, outperforming even some larger models on key benchmarks.

Overview Specifications Activity Performance Uptime Examples API Reference

Compare

Pricing-50%

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.05

Output Tokens (1M)

$0.20

Capabilities

Input Modalities

TextImageFile

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Frequency PenaltyFunction CallFunctionsLogit BiasMax Completion TokensParallel Tool CallsPredictionPresence PenaltyResponse FormatStopTemperatureTool ChoiceToolsTop PWeb Search Options