GPT 5.1 Chat

Chat Completions

gpt-5.1-chat

OpenAI|Created Nov 13, 2025|128k context

Chat Completions

GPT-5.1 Chat (also known as Instant) is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.1 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

Overview Specifications Activity Performance Uptime Examples API Reference

Compare

Pricing-50%

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.63

Cached Input Tokens (1M)

$0.06

Output Tokens (1M)

$5.00

Capabilities

Input Modalities

TextImageFile

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Frequency PenaltyFunction CallFunctionsLogit BiasMax Completion TokensParallel Tool CallsPredictionPresence PenaltyReasoning EffortResponse FormatStopTool ChoiceToolsWeb Search Options