Whisper Large v3
whisper-large-v3
by openai|Created May 25, 2025
Whisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translation tasks.
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Transcription (1 minute)
$0.0008
Capabilities
Input Modalities
Audio
Output Modalities
Text
Usage Analytics
Token usage across the last 30 active days