DeepSeek Chat v3.1 vs Deepseek v3.2

Compare DeepSeek Chat v3.1 and Deepseek v3.2 on key metrics including price, context length, throughput, and other model features.

AuthorDeepseek

Context Length32.8k

Supports Tools

DeepSeek-V3.1 is a 671B-parameter hybrid reasoning model (37B active), supporting both "thinking" and "non-thinking" modes via prompt templates. It extends DeepSeek-V3 with two-phase long-context training (up to 128K tokens) and uses FP8 microscaling for efficient inference. The model excels in tool use, code generation, and reasoning, with performance comparable to DeepSeek-R1 but with faster responses. It supports structured tool calling, code agents, and search agents, making it ideal for research and agentic workflows. Successor to DeepSeek V3-0324, it delivers strong performance across diverse tasks.

Activity

Last 14 days

Prompt

46M

Completion

Total

49M

Startup

Deepseek

Latency (p50)0.46s

Throughput (p50)32.0 tok/s

Pricing

Input$0.10/M tokens

Output$0.40/M tokens

Cached input-

Features

Input Modalitiestext

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model

AuthorDeepseek

Context Length163.8k

Supports Tools

DeepSeek-V3.2 is a large language model optimized for high computational efficiency and strong tool-use reasoning. It features DeepSeek Sparse Attention (DSA), a mechanism that lowers training and inference costs while maintaining quality in long-context tasks. A scalable reinforcement learning post-training framework further enhances reasoning, achieving performance comparable to GPT-5 and earning top results on the 2025 IMO and IOI. V3.2 also leverages large-scale agentic task synthesis to improve reasoning in practical tool-use scenarios, boosting its generalization and compliance in interactive environments.

Activity

Last 14 days

Prompt

118M

Completion

Total

122M

Startup

Deepseek

Latency (p50)1.18s

Throughput (p50)19.7 tok/s

Pricing

Input$0.14/M tokens

Output$0.21/M tokens

Cached input$0.01/M tokens

Features

Input Modalitiestext

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model