How speech-to-text works
Using a simple <Gather> command, the Speech Recognition API captures your speech in real-time, transcribes it, and returns text.
Real-time transcription
Add automatic speech recognition (ASR) the easy way.
No training required
Transcribe a wide range of industry-specific words and phrases out of the box, without any pre-training.
Streaming results
Build responsive voice applications that act on partial recognition results as your customer speaks.
Multiple languages
Recognizes 119 languages and dialects (and more coming soon) to support your global user base.
Use cases
Give customers the choice to use their natural language to navigate menus and collect information.
The Twilio difference
Experience a 99.95% uptime SLA made possible with automated failover and zero maintenance windows.
Extend the same app you write once to new markets with configurable features for localization and compliance.
Use the same platform you know for voice, SMS, video, chat, two-factor authentication, and more.
Get to market faster with pay-as-you-go pricing, free support, and the freedom to scale up or down without contracts.
* Only phone_call model is available for premium