Scribe is a speech-to-text model built for accuracy and handling real-world audio. It supports 99 languages and features word-level timestamps, speaker diarization, and audio-event tagging.
elevenlabs.scribe-v1
Use this ID when making API calls to reference this model
ElevenLabs
general
premium
February 26, 2025
$0.40000/hour
$0.00011/second
2h 0m
953.67 MB
Supported capabilities and functionalities
Input/output formats and technical details