Deepseek V4 Flash

Chat Completions

deepseek-v4-flash

Deepseek|Created Apr 24, 2026|1.0M context

Chat Completions

DeepSeek V4 Flash is an efficiency-focused Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B active parameters, supporting a 1M-token context window. It is built for fast inference and high-throughput workloads while preserving strong reasoning and coding capabilities. The model features hybrid attention for efficient long-context processing and offers configurable reasoning modes. It is a strong fit for use cases such as coding assistants, chat applications, and agent workflows where responsiveness and cost efficiency matter.

Overview Specifications Activity Performance Uptime Examples API Reference

Compare

Endpoints and request shape

This page collects the public integration surface for the model: supported endpoints, available request parameters, and example calls through the NagaAI API.

Chat Completions

Code Example

Example code for using this model through our API with Python (OpenAI SDK) or cURL. Replace placeholders with your API key and model ID.

Basic request example. Ensure API key permissions. For more details, see our documentation.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.naga.ac/v1",
    api_key="YOUR_API_KEY",
)

resp = client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[
        {"role": "user", "content": "What's 2+2?"}
    ],
    temperature=0.2,
)
print(resp.choices[0].message.content)

Supported Parameters

Available parameters for API requests

Frequency PenaltyMax Completion TokensPresence PenaltyResponse FormatReasoning EffortStopTemperatureTool ChoiceToolsTop P