Qwen3 235B A22B 2507

qwen3-235b-a22b-2507
by qwen|Created Jul 22, 2025

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. Optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. Supports a native 262K context length and delivers significant gains in knowledge coverage, long-context reasoning, and coding benchmarks.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.07

Output Tokens (1M)

$0.42

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Throughput

Time-To-First-Token (TTFT)