Ahoya

Glossary

What is speech-to-text?

Speech-to-text (STT), also called speech recognition, converts spoken audio into written text. It's the first step in a voice AI system: turning what a caller says into text a language model can understand.

How it's used in phone AI

On a call, speech-to-text transcribes the caller's words in real time so the system can interpret intent. Accuracy with accents, background noise, and industry terms directly affects how well the AI understands and responds.

Related terms

Ahoya is an AI receptionist that answers every call 24/7.

Start free