Glossary
What is text-to-speech?
Text-to-speech (TTS) converts written text into spoken audio with a synthetic voice. In a voice AI system, it's the final step: turning the assistant's chosen response into natural-sounding speech the caller hears.
Why voice quality matters
Modern neural text-to-speech sounds close to human, with natural pacing and intonation. A clear, warm voice makes callers comfortable continuing the conversation, while robotic speech makes them hang up β so voice choice is part of the customer experience.
See also
Related terms
Ahoya is an AI receptionist that answers every call 24/7.
Start free