whisper-large-v3

by openai

Whisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translation tasks.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Transcription (1 minute)

$0.0030

Capabilities

Input Modalities

Audio

Output Modalities

Text

Rate Limits

Requests per minute (RPM) and per day (RPD) by tier. More about tiers here

TierRPMRPD
Free225
Tier 120
Tier 230
Tier 350
Tier 480

Usage Analytics

Token usage across the last 30 active days

whisper-large-v3 — Model | NagaAI