Phi-4

phi-4
byMicrosoft|Created May 25, 2025
Chat Completions

Phi-4 is a 14B-parameter model from Microsoft Research, designed for complex reasoning tasks and efficient operation in low-memory or rapid-response scenarios. Trained on a mix of high-quality synthetic and curated data, it is optimized for English language inputs and demonstrates strong instruction following and safety standards. For more details, see the [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905).

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.04

Output Tokens (1M)

$0.07

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Uptime

Reliability over the last 7 days

Throughput

Time-To-First-Token (TTFT)

Code Example

Example code for using this model through our API with Python (OpenAI SDK) or cURL. Replace placeholders with your API key and model ID.

Basic request example. Ensure API key permissions. For more details, see our documentation.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.naga.ac/v1",
    api_key="YOUR_API_KEY",
)

resp = client.chat.completions.create(
    model="phi-4",
    messages=[
        {{"role": "user", "content": "What's 2+2?"}}
    ],
    temperature=0.2,
)
print(resp.choices[0].message.content)