Claude Haiku 4.5

Chat Completions

claude-haiku-4.5-20251001

Anthropic|Created Oct 15, 2025|200k context

Chat Completions

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, offering near-frontier intelligence with much lower cost and latency than larger Claude models. It matches Claude Sonnet 4’s performance in reasoning, coding, and computer-use tasks, making it ideal for real-time and large-scale applications. Haiku 4.5 introduces controllable reasoning depth, supports summarized or interleaved thought outputs, and enables tool-assisted workflows across coding, bash, web search, and computer-use tools. With over 73% on SWE-bench Verified, it stands among the top coding models while maintaining fast responsiveness for sub-agents, parallel execution, and scaled deployment.

Overview Specifications Activity Performance Uptime Examples API Reference

Compare

Pricing-50%

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.50

Output Tokens (1M)

$2.50

Capabilities

Input Modalities

TextImageFile

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Frequency PenaltyLogit BiasMax Completion TokensPresence PenaltyReasoning EffortResponse FormatStopTemperatureTool ChoiceToolsTop P

Usage Analytics

Token usage of this model on our platform

Throughput

Time-To-First-Token (TTFT)

Code Example

Example code for using this model through our API with Python (OpenAI SDK) or cURL. Replace placeholders with your API key and model ID.

Basic request example. Ensure API key permissions. For more details, see our documentation.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.naga.ac/v1",
    api_key="YOUR_API_KEY",
)

resp = client.chat.completions.create(
    model="claude-haiku-4.5-20251001",
    messages=[
        {"role": "user", "content": "What's 2+2?"}
    ],
    temperature=0.2,
)
print(resp.choices[0].message.content)