whisper-large-v3
by openaiWhisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translation tasks.
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Transcription (1 minute)
$0.0030
Capabilities
Input Modalities
Audio
Output Modalities
Text
Rate Limits
Requests per minute (RPM) and per day (RPD) by tier. More about tiers here
Tier | RPM | RPD |
---|---|---|
Free | 2 | 25 |
Tier 1 | 20 | — |
Tier 2 | 30 | — |
Tier 3 | 50 | — |
Tier 4 | 80 | — |
Usage Analytics
Token usage across the last 30 active days