FalAI - ElevenLabs Speech to Text

Generate text from speech using ElevenLabs advanced speech-to-text model. Supports 99 languages with state-of-the-art accuracy.

Provider

FalAI

Model Type

general

Accuracy Tier

premium

Supported Languages

No language information available
Performance & Cost

Cost

$1.86486/hour

$0.00052/second

Features

Supported capabilities and functionalities

Core Features

Punctuation
Diarization
Streaming
Speaker Labels
Word Timestamps
Confidence Scores
Language Detection
Custom Vocabulary
Profanity Filtering
Noise Reduction
Technical Specifications

Input/output formats and technical details

Supported Output Formats

textjson

Supported Audio Encodings

mp3oggwavm4aaac