GPT-4.1 Nano

Review GPT-4.1 Nano on key metrics including price, context length, throughput, and model features.

AuthorOpenAI

Context Length1.0M

Supports Tools

The fastest and most cost-effective model in the GPT-4.1 series, designed for tasks demanding low latency such as classification and autocompletion. Maintains a 1 million token context window and delivers exceptional performance at a small size, outperforming even some larger models on key benchmarks.

Activity

Last 14 days

Prompt

48K

Completion

19K

Total

67K

Startup

OpenAI

Latency (p50)0.84s

Throughput (p50)77.3 tok/s

Pricing

Input$0.05/M tokens

Output$0.20/M tokens

Features

Input Modalitiestext, image, file

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model