GPT-5.3-Codex vs GPT OSS 20B

Compare GPT-5.3-Codex and GPT OSS 20B on key metrics including price, context length, throughput, and other model features.

AuthorOpenAI

Context Length400k

Supports Tools

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model. It pairs the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It delivers state-of-the-art results on SWE-Bench Pro and strong performance on Terminal-Bench 2.0 and OSWorld-Verified, highlighting better multi-language coding, terminal fluency, and real-world computer-use skills. The model is tuned for long-running, tool-driven workflows and supports interactive steering during execution, making it well-suited for complex development work, debugging, deployment, and iterative product cycles. Outside of coding, GPT-5.3-Codex also performs well on structured knowledge-work benchmarks such as GDPval, enabling tasks like drafting documents, analyzing spreadsheets, creating slides, and conducting operational research across domains. It is trained with increased cybersecurity awareness, including the ability to identify vulnerabilities, and is deployed with extra safeguards for higher-risk scenarios. Relative to earlier Codex models, it is more token-efficient and about 25% faster, aimed at end-to-end professional workflows that combine reasoning, execution, and computer interaction.

Activity

Last 14 days

Prompt

240M

Completion

Total

241M

Startup

OpenAI

Latency (p50)0.53s

Throughput (p50)47.6 tok/s

Pricing

Input$0.88/M tokens

Output$7.00/M tokens

Cached input$0.09/M tokens

Features

Input Modalitiestext, image

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model

AuthorOpenAI

Context Length131.1k

Supports Tools

OpenAI’s 21B-parameter open-weight Mixture-of-Experts (MoE) model, released under the Apache 2.0 license. Features 3.6B active parameters per forward pass, optimized for low-latency inference and deployability on consumer or single-GPU hardware. Trained in OpenAI’s Harmony response format, it supports reasoning level configuration, fine-tuning, and agentic capabilities such as function calling and structured outputs.

Activity

Last 14 days

Prompt

47M

Completion

Total

55M

Startup

OpenAI

Latency (p50)0.14s

Throughput (p50)556.5 tok/s

Pricing

Input$0.02/M tokens

Output$0.10/M tokens

Cached input-

Features

Input Modalitiestext, file

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model