Qwen3 14B

qwen3-14b
byQwen|Created Jun 2, 2025
Chat Completions

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. Supports seamless switching between a "thinking" mode for tasks like math, programming, and logical inference, and a "non-thinking" mode for general-purpose conversation. Fine-tuned for instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.04

Output Tokens (1M)

$0.12

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Uptime

Reliability over the last 7 days

Throughput

Time-To-First-Token (TTFT)

Code Example

Example code for using this model through our API with Python (OpenAI SDK) or cURL. Replace placeholders with your API key and model ID.

Basic request example. Ensure API key permissions. For more details, see our documentation.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.naga.ac/v1",
    api_key="YOUR_API_KEY",
)

resp = client.chat.completions.create(
    model="qwen3-14b",
    messages=[
        {{"role": "user", "content": "What's 2+2?"}}
    ],
    temperature=0.2,
)
print(resp.choices[0].message.content)