Google Cloud - Standard

Standard speech recognition model by Google

Model ID

google.standard

Use this ID when making API calls to reference this model

Provider

google

Supported Languages

af-ZAsq-ALam-ETar-DZar-BHar-EGar-IQar-ILar-JOar-KWar-LBar-MRar-MAar-OMar-QAar-SAar-PSar-SYar-TNar-AEar-YEhy-AMaz-AZeu-ESbn-BDbn-INbs-BAbg-BGmy-MMca-EScmn-Hans-CNcmn-Hans-HKcmn-Hant-TWyue-Hant-HKhr-HRcs-CZda-DKnl-BEnl-NLen-AUen-CAen-GHen-HKen-INen-IEen-KEen-NZen-NGen-PKen-PHen-SGen-ZAen-TZen-GBen-USet-EEfil-PHfi-FIfr-BEfr-CAfr-FRfr-CHgl-ESka-GEde-ATde-DEde-CHel-GRgu-INiw-ILhi-INhu-HUis-ISid-IDit-ITit-CHja-JPjv-IDkn-INkk-KZkm-KHrw-RWko-KRlo-LAlv-LVlt-LTmk-MKms-MYml-INmr-INmn-MNne-NPno-NOfa-IRpl-PLpt-BRpt-PTpa-Guru-INro-ROru-RUsr-RSsi-LKsk-SKsl-SIst-ZAes-ARes-BOes-CLes-COes-CRes-DOes-ECes-SVes-GTes-HNes-MXes-NIes-PAes-PYes-PEes-PRes-ESes-USes-UYes-VEsu-IDsw-KEsw-TZss-Latn-ZAsv-SEta-INta-MYta-SGta-LKte-INth-THts-ZAtn-Latn-ZAtr-TRuk-UAur-INur-PKuz-UZve-ZAvi-VNxh-ZAzu-ZA
Automatic Language Detection: Yes
Performance & Cost

Cost

$0.96000/hour

$0.00027/second

Features

Supported capabilities and functionalities

Core Features

Punctuation
Diarization
Streaming
Speaker Labels
Word Timestamps
Confidence Scores
Custom Vocabulary
Profanity Filtering
Noise Reduction
Voice Activity Detection

Subtitle Formats

SRT Support
VTT Support
Technical Specifications

Input/output formats and technical details

Subtitle Format Support

SRT VTT

Supported Audio Encodings

FLACLINEAR16MULAWAMRAMR_WBMP3