OpenAI

Token usage over time

Browse models from OpenAI

51 models

Text-Embedding-3-Small is OpenAI’s efficient, compact embedding model, designed to convert text into numerical representations for semantic tasks such as search, clustering, and recommendations. It offers improved performance and cost-effectiveness compared to previous models, with low latency and storage requirements.

byOpenAI
$0.0067/1M tokens

GPT-5 Mini

674M Tokens

A compact variant of GPT-5, designed for efficient handling of lighter-weight reasoning and conversational tasks. GPT-5 Mini retains the instruction-following and safety features of its larger counterpart, but with reduced latency and cost. It is the direct successor to OpenAI’s o4-mini model, making it ideal for scalable, cost-sensitive deployments.

byOpenAI
$0.13/1M input tokens$1.00/1M output tokens

The continually updated version of OpenAI ChatGPT 4o, always pointing to the current GPT-4o model used by ChatGPT. Incorporates additional RLHF and may differ from the API version. Intended for research and evaluation, not recommended for production as it may be redirected or removed in the future.

byOpenAI
Free

GPT OSS 120B

492M Tokens

An open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI, designed for high-reasoning, agentic, and general-purpose production use cases. Activates 5.1B parameters per forward pass and is optimized for single H100 GPU deployment with native MXFP4 quantization. Supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.

byOpenAI
$0.07/1M input tokens$0.30/1M output tokens

Text-Embedding-3-Large is OpenAI’s most capable embedding model, supporting both English and non-English text tasks. It produces high-dimensional embeddings (up to 3072 dimensions) for advanced semantic similarity, search, and clustering, and allows flexible trade-offs between performance and resource usage.

byOpenAI
$0.04/1M tokens

GPT-5 Nano

130M Tokens

The smallest and fastest member of the GPT-5 family, optimized for developer tools, rapid user interactions, and ultra-low latency environments. While it offers limited reasoning depth compared to larger models, GPT-5-Nano preserves essential instruction-following and safety mechanisms. It is the successor to GPT-4.1-nano and is best suited for real-time, cost-sensitive, or embedded applications.

byOpenAI
$0.02/1M input tokens$0.20/1M output tokens

GPT-5 Mini (Free)

129M Tokens

A compact variant of GPT-5, designed for efficient handling of lighter-weight reasoning and conversational tasks. GPT-5 Mini retains the instruction-following and safety features of its larger counterpart, but with reduced latency and cost. It is the direct successor to OpenAI’s o4-mini model, making it ideal for scalable, cost-sensitive deployments.

byOpenAI
Free

GPT 5.1 Codex

23M Tokens

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It's designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5.1, Codex is more steerable, closely follows developer instructions, and produces cleaner, higher-quality code. Codex integrates into developer environments like the CLI, IDE extensions, GitHub, and cloud tasks. It adapts its reasoning dynamically—providing quick answers for small tasks and sustaining long, multi-hour runs for large projects. The model is trained for structured code reviews, identifying critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs like images or screenshots for UI development and integrates tools for search, dependency installation, and environment setup. Codex is specifically intended for agentic coding applications.

byOpenAI
$0.63/1M input tokens$5.00/1M output tokens

GPT 5.1

34.1M Tokens

GPT-5.1 is the newest top-tier model in the GPT-5 series, featuring enhanced general reasoning, better instruction following, and a more natural conversational tone compared to GPT-5. With adaptive reasoning, it dynamically adjusts its computational effort—responding swiftly to simple queries and diving deeper into complex tasks. Explanations are now clearer and use less jargon, making challenging topics easier to grasp. Designed for a wide range of tasks, GPT-5.1 consistently improves performance in math, coding, and structured analysis, offering more cohesive long-form responses and more reliable tool usage. Its conversation style is warmer and more intuitive, yet still precise. GPT-5.1 stands as the main, fully capable successor to GPT-5.

byOpenAI
$0.63/1M input tokens$5.00/1M output tokens

A mid-sized GPT-4.1 model delivering performance competitive with GPT-4o at substantially lower latency and cost. Retains a 1 million token context window and demonstrates strong coding ability and vision understanding, making it suitable for interactive applications with tight performance constraints.

byOpenAI
Free

GPT-4o (“o” for “omni”) is OpenAI’s latest multimodal model, supporting both text and image inputs with text outputs. Delivers improved performance in non-English languages and visual understanding, while being faster and more cost-effective than previous models.

byOpenAI
Free

GPT-4o (Free)

112M Tokens

The November 2024 release of GPT-4o, featuring enhanced creative writing, more natural and engaging responses, and improved file handling. Maintains the intelligence of GPT-4 Turbo while being twice as fast and 50% more cost-effective, with better support for non-English languages and visual tasks.

byOpenAI
Free

GPT-4o Mini

223M Tokens

OpenAI’s most advanced small model, GPT-4o mini, supports both text and image inputs with text outputs. It is highly cost-effective, achieving SOTA intelligence and outperforming larger models on key benchmarks, making it ideal for scalable, interactive applications.

byOpenAI
$0.07/1M input tokens$0.30/1M output tokens

GPT-4.1 Mini

133M Tokens

A mid-sized GPT-4.1 model delivering performance competitive with GPT-4o at substantially lower latency and cost. Retains a 1 million token context window and demonstrates strong coding ability and vision understanding, making it suitable for interactive applications with tight performance constraints.

byOpenAI
$0.20/1M input tokens$0.80/1M output tokens

GPT-5

354M Tokens

OpenAI’s most advanced large language model, engineered for high-stakes applications requiring step-by-step reasoning, precise instruction following, and robust code generation. GPT-5 introduces major improvements in factual accuracy, user intent understanding, and hallucination reduction. It supports advanced prompt routing, user-specified intent (such as "think hard about this"), and is optimized for complex workflows in coding, writing, and health-related domains.

byOpenAI
$0.63/1M input tokens$5.00/1M output tokens

GPT-4o

97.8M Tokens

The November 2024 release of GPT-4o, featuring enhanced creative writing, more natural and engaging responses, and improved file handling. Maintains the intelligence of GPT-4 Turbo while being twice as fast and 50% more cost-effective, with better support for non-English languages and visual tasks.

byOpenAI
$1.25/1M input tokens$5.00/1M output tokens

GPT-4 (Free)

6.39M Tokens

GPT-4.1, a flagship model for advanced instruction following, software engineering, and long-context reasoning. Supports a 1 million token context window and is tuned for precise code diffs, agent reliability, and high recall in large document contexts.

byOpenAI
Free

GPT-4o Mini (Free)

101M Tokens

OpenAI’s most advanced small model, GPT-4o mini, supports both text and image inputs with text outputs. It is highly cost-effective, achieving SOTA intelligence and outperforming larger models on key benchmarks, making it ideal for scalable, interactive applications.

byOpenAI
Free

GPT-5 Chat is tailored for advanced, natural, and context-aware conversations in enterprise environments. It leverages the latest advancements in OpenAI’s conversational AI, supporting multimodal and dynamic dialogue with enhanced context retention and user intent understanding.

byOpenAI
$0.63/1M input tokens$5.00/1M output tokens

The August 2024 version of GPT-4o, offering improved structured output capabilities, including support for JSON schema in responses. Maintains high intelligence and efficiency, with enhanced non-English and visual performance.

byOpenAI
Free

GPT-4.1

137M Tokens

A flagship large language model from OpenAI, optimized for advanced instruction following, real-world software engineering, and long-context reasoning. Supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 in coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding. Tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.

byOpenAI
$1.00/1M input tokens$4.00/1M output tokens

DALL-E 3 (Free)

2.29M Tokens

DALL-E 3 is OpenAI’s third-generation text-to-image model, offering enhanced detail, accuracy, and the ability to understand complex prompts. It excels at generating realistic and creative images, handling intricate details like text and human anatomy, and supports various aspect ratios for flexible output.

byOpenAI
Free

GPT 5.1 Chat

5.12M Tokens

GPT-5.1 Chat (also known as Instant) is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.1 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

byOpenAI
$0.63/1M input tokens$5.00/1M output tokens

O4 Mini

37.5M Tokens

A compact reasoning model in OpenAI’s o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. Supports tool use and demonstrates competitive reasoning and coding performance across benchmarks, outperforming its predecessor o3-mini and approaching o3 in some domains. Well-suited for high-throughput scenarios where latency or cost is critical.

byOpenAI
$0.55/1M input tokens$2.20/1M output tokens

GPT 5.1 Codex Mini

169K Tokens

GPT-5.1-Codex-Mini is a more compact and faster variant of GPT-5.1-Codex.

byOpenAI
$0.13/1M input tokens$1.00/1M output tokens

GPT-4o Mini TTS

1.05M Tokens

A text-to-speech model built on GPT-4o mini, a fast and powerful language model. Use it to convert text into natural-sounding spoken audio.

byOpenAI
~$0.0076/1k chars

GPT OSS 20B

516K Tokens

OpenAI’s 21B-parameter open-weight Mixture-of-Experts (MoE) model, released under the Apache 2.0 license. Features 3.6B active parameters per forward pass, optimized for low-latency inference and deployability on consumer or single-GPU hardware. Trained in OpenAI’s Harmony response format, it supports reasoning level configuration, fine-tuning, and agentic capabilities such as function calling and structured outputs.

byOpenAI
$0.02/1M input tokens$0.10/1M output tokens

Whisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translation tasks.

byOpenAI
Free

O3

24.5M Tokens

A well-rounded, powerful model from OpenAI, setting new standards in math, science, coding, and visual reasoning. Excels at technical writing and instruction-following, and is designed for multi-step problem solving across text, code, and images. BYOK is required for access.

byOpenAI
$1.00/1M input tokens$4.00/1M output tokens

GPT Image 1

2.99M Tokens

OpenAI’s new state-of-the-art image generation model. This is a natively multimodal language model that accepts both text and image inputs and produces image outputs. It powers image generation in ChatGPT, offering exceptional prompt adherence, a high level of detail, and quality.

byOpenAI
~$0.04/image

GPT-5 Codex

36.8M Tokens

GPT-5-Codex is a specialized version of GPT-5 tailored for software engineering and coding tasks. It is suitable for both interactive development sessions and the independent execution of complex engineering projects. The model is capable of building projects from scratch, developing new features, debugging, performing large-scale refactoring, and conducting code reviews. Compared to the standard GPT-5, Codex offers greater steerability, follows developer instructions more closely, and delivers cleaner, higher-quality code.

byOpenAI
$0.63/1M input tokens$5.00/1M output tokens

DALL-E 3

134K Tokens

DALL-E 3 is OpenAI’s third-generation text-to-image model, offering enhanced detail, accuracy, and the ability to understand complex prompts. It excels at generating realistic and creative images, handling intricate details like text and human anatomy, and supports various aspect ratios for flexible output.

byOpenAI
~$0.04/image

GPT-4o Transcribe

373K Tokens

A speech-to-text model using GPT-4o for transcribing audio. It offers improved word error rate, better language recognition, and higher accuracy compared to the original Whisper models. Use it for more precise transcripts.

byOpenAI
~$0.0038/minute

A text-to-speech model built on GPT-4o mini, a fast and powerful language model. Use it to convert text into natural-sounding spoken audio.

byOpenAI
Free

Omni Moderation

24.6K Tokens

Omni-Moderation is OpenAI’s newest multimodal content moderation model, available through the Moderation API. It is designed to identify potentially harmful content in both text and images, offering improved accuracy and granular control, especially in non-English languages.

byOpenAI
Free

Whisper Large v3

1.34M Tokens

Whisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translation tasks.

byOpenAI
~$0.0008/minute

Whisper large-v3-turbo is a finetuned version of a pruned Whisper large-v3. In other words, it's the exact same model, except that the number of decoding layers have reduced from 32 to 4. As a result, the model is way faster, at the expense of a minor quality degradation.

byOpenAI
~$0.0001/minute

GPT-4 1106 Preview

1.34M Tokens

The April 2023 release of GPT-4 Turbo, supporting vision, JSON mode, and function calling. Trained on data up to April 2023, optimized for advanced multimodal tasks.

byOpenAI
$5.00/1M input tokens$15.00/1M output tokens

O1

72.5K Tokens

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. Trained with large-scale reinforcement learning for chain-of-thought reasoning, it is optimized for math, science, programming, and other STEM tasks, consistently achieving PhD-level accuracy on industry benchmarks.

byOpenAI
$7.50/1M input tokens$30.00/1M output tokens

GPT-4

888K Tokens

GPT-4.1, a flagship model for advanced instruction following, software engineering, and long-context reasoning. Supports a 1 million token context window and is tuned for precise code diffs, agent reliability, and high recall in large document contexts.

byOpenAI
$15.00/1M input tokens$30.00/1M output tokens

GPT-4o 2024-05-13

133K Tokens

GPT-4o (“o” for “omni”) is OpenAI’s latest multimodal model, supporting both text and image inputs with text outputs. Delivers improved performance in non-English languages and visual understanding, while being faster and more cost-effective than previous models.

byOpenAI
$2.50/1M input tokens$7.50/1M output tokens

GPT-4 Turbo

20.6K Tokens

The latest GPT-4 Turbo model with vision capabilities, supporting JSON mode and function calling. Trained on data up to December 2023, it is optimized for high-throughput, multimodal applications.

byOpenAI
$5.00/1M input tokens$15.00/1M output tokens

GPT-4 Turbo Preview

7.76K Tokens

Preview release of GPT-4, featuring improved instruction following, JSON mode, reproducible outputs, and parallel function calling. Trained on data up to December 2023. Heavily rate-limited while in preview.

byOpenAI
$5.00/1M input tokens$15.00/1M output tokens

O1 Mini

54.8K Tokens

Experimental mini version of OpenAI’s o1 model, optimized for STEM tasks with efficient performance. Not recommended for production use and may be heavily rate-limited.

byOpenAI
$1.50/1M input tokens$6.00/1M output tokens

GPT-4o 2024-08-06

118K Tokens

The August 2024 version of GPT-4o, offering improved structured output capabilities, including support for JSON schema in responses. Maintains high intelligence and efficiency, with enhanced non-English and visual performance.

byOpenAI
$1.25/1M input tokens$5.00/1M output tokens

Codex Mini

864K Tokens

A fine-tuned version of o4-mini, specifically optimized for use in Codex CLI. Recommended for code-related tasks, with improved performance in code generation and completion.

byOpenAI
$0.75/1M input tokens$3.00/1M output tokens

Specialized GPT-4o variant trained for web search understanding and execution within chat completions, enabling advanced search query comprehension.

byOpenAI
$1.25/1M input tokens$5.00/1M output tokens

ChatGPT-4o Latest

9.83M Tokens

The continually updated version of OpenAI ChatGPT 4o, always pointing to the current GPT-4o model used by ChatGPT. Incorporates additional RLHF and may differ from the API version. Intended for research and evaluation, not recommended for production as it may be redirected or removed in the future.

byOpenAI
$2.50/1M input tokens$7.50/1M output tokens

GPT-4.1 Nano

11.8M Tokens

The fastest and most cost-effective model in the GPT-4.1 series, designed for tasks demanding low latency such as classification and autocompletion. Maintains a 1 million token context window and delivers exceptional performance at a small size, outperforming even some larger models on key benchmarks.

byOpenAI
$0.05/1M input tokens$0.20/1M output tokens

O3 Mini

6.9M Tokens

A cost-efficient language model from OpenAI, optimized for STEM reasoning tasks, especially in science, mathematics, and coding. Supports the `reasoning_effort` parameter for adjustable thinking time and features significant improvements over its predecessor, with better performance on complex questions and lower latency and cost.

byOpenAI
$0.55/1M input tokens$2.20/1M output tokens

Text-Embedding-Ada-002 is a widely used text embedding model from OpenAI, converting text into semantic vectors for tasks like search, clustering, recommendations, and classification. It is known for strong performance and efficiency, making it a standard choice for embedding applications.

byOpenAI
$0.03/1M tokens