whisper-large-v3

by openai

Whisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translat...

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Transcription (1 minute)

$0.0030

Capabilities

Input Modalities

Audio

Output Modalities

Text

Rate Limits

Requests per minute (RPM) and per day (RPD) by tier. More about tiers here

TierRPMRPD
Free225
Tier 1202000
Tier 2303000
Tier 3505000
Tier 4808000

Usage Analytics

Token usage across the last 30 active days