Whisper Large v3 Turbo

Transcriptions+1 More

whisper-large-v3-turbo

OpenAI|Created Sep 15, 2025

Transcriptions+1 More

Whisper large-v3-turbo is a finetuned version of a pruned Whisper large-v3. In other words, it's the exact same model, except that the number of decoding layers have reduced from 32 to 4. As a result, the model is way faster, at the expense of a minor quality degradation.

Overview Specifications Activity Performance Uptime Examples

Pricing-50%

Pay-as-you-go rates for this model. More details can be found here.

Transcription (1 minute)

$0.0001

Capabilities

Input Modalities

Audio

Output Modalities

Text

Supported Parameters

Available parameters for API requests

Language

Usage Analytics

Token usage of this model on our platform

Throughput

Not enough throughput data

Time-To-First-Token (TTFT)

Not enough TTFT data

Code Example

Example code for using this model through our API with Python (OpenAI SDK) or cURL. Replace placeholders with your API key and model ID.

Basic request example. Ensure API key permissions. For more details, see our documentation.

from openai import OpenAI

client = OpenAI(base_url="https://api.naga.ac/v1", api_key="YOUR_API_KEY")

with open("audio.mp3", "rb") as f:
    transcription = client.audio.transcriptions.create(
        model="whisper-large-v3-turbo",
        file=f,
        prompt=None,
        language=None,
    )
print(transcription.text)