OpenAI - Whisper

Model Information

vlarge-v2

General-purpose speech recognition model. Based on the open-source Whisper large-v2 model, offering faster performance than the open-source version.

openai.whisper-1

Use this ID when making API calls to reference this model

OpenAI

general

premium

March 1, 2023

afarhyazbebsbgcazhhrcsdanlenetfifrgldeelhehihuisiditjaknkkkolvltmkmsmrminenofaplptrorusrskslesswsvtltathtrukurvicy

Automatic Language Detection: Yes

afarhyazbebsbgcazhhrcsdanlenetfifrgldeelhehihuisiditjaknkkkolvltmkmsmrminenofaplptrorusrskslesswsvtltathtrukurvicy

Performance & Cost

$0.36000/hour

$0.00010/second

25.00 MB

Features

Supported capabilities and functionalities

Punctuation

Diarization

Streaming

Speaker Labels

Word Timestamps

Confidence Scores

Custom Vocabulary

Profanity Filtering

Noise Reduction

Voice Activity Detection

SRT Support

VTT Support

Technical Specifications

Input/output formats and technical details

SRT VTT

M4AMP3WebMMP4MPGAWAVMPEG