GPT 5.4 vs GPT-5.3-Codex

Compare GPT 5.4 and GPT-5.3-Codex on key metrics including price, context length, throughput, and other model features.

AuthorOpenAI

Context Length1.1M

Supports Tools

GPT-5.4 is OpenAI’s newest frontier model that merges the Codex and GPT families into a single unified system. It offers a 1M+ token context window (922K input, 128K output) and supports both text and image inputs, enabling high-context reasoning, coding, and multimodal analysis in one workflow. The model brings stronger performance in coding, document understanding, tool usage, and instruction following. It’s built to be a solid default for both general tasks and software engineering—able to produce production-ready code, synthesize information from multiple sources, and handle complex multi-step workflows with fewer iterations and better token efficiency.

Activity

Last 14 days

Prompt

36M

Completion

922K

Total

37M

Startup

OpenAI

Latency (p50)0.70s

Throughput (p50)117.0 tok/s

Pricing

Input$1.25/M tokens

Output$7.50/M tokens

Cached input$0.13/M tokens

Features

Input Modalitiestext, image, file

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model

AuthorOpenAI

Context Length400k

Supports Tools

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model. It pairs the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It delivers state-of-the-art results on SWE-Bench Pro and strong performance on Terminal-Bench 2.0 and OSWorld-Verified, highlighting better multi-language coding, terminal fluency, and real-world computer-use skills. The model is tuned for long-running, tool-driven workflows and supports interactive steering during execution, making it well-suited for complex development work, debugging, deployment, and iterative product cycles. Outside of coding, GPT-5.3-Codex also performs well on structured knowledge-work benchmarks such as GDPval, enabling tasks like drafting documents, analyzing spreadsheets, creating slides, and conducting operational research across domains. It is trained with increased cybersecurity awareness, including the ability to identify vulnerabilities, and is deployed with extra safeguards for higher-risk scenarios. Relative to earlier Codex models, it is more token-efficient and about 25% faster, aimed at end-to-end professional workflows that combine reasoning, execution, and computer interaction.

Activity

Last 14 days

Prompt

31M

Completion

397K

Total

32M

Startup

OpenAI

Latency (p50)0.76s

Throughput (p50)49.6 tok/s

Pricing

Input$0.88/M tokens

Output$7.00/M tokens

Cached input$0.09/M tokens

Features

Input Modalitiestext, image

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model