Qwen3 235B A22B Thinking 2507
Chat Completionsqwen3-235b-a22b-thinking-2507
Chat Completions
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. Activates 22B of its 235B parameters per forward pass and natively supports up to 262,144 tokens of context. This "thinking-only" variant enhances structured logical reasoning, mathematics, science, and long-form generation, and is instruction-tuned for step-by-step reasoning, tool use, agentic workflows, and multilingual tasks.
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Input Tokens (1M)
$0.07
Output Tokens (1M)
$0.30
Capabilities
Input Modalities
Text
Output Modalities
Text
Supported Parameters
Available parameters for API requests
Frequency Penalty
Logit Bias
Logprobs
Max Completion Tokens
Presence Penalty
Reasoning Effort
Response Format
Stop
Temperature
Tool Choice
Tools
Top P