Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. Supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. Demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects.
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Input Tokens (1M)
$0.05
Output Tokens (1M)
$0.15
Capabilities
Input Modalities
Text
Output Modalities
Text
Supported Parameters
Available parameters for API requests
Logprobs
Max Completion Tokens
Reasoning Effort
Response Format
Stop
Temperature
Tool Choice
Tools
Top P