Glossary

What is text-to-speech?

Text-to-speech (TTS) converts written text into spoken audio with a synthetic voice. In a voice AI system, it's the final step: turning the assistant's chosen response into natural-sounding speech the caller hears.

Why voice quality matters

Modern neural text-to-speech sounds close to human, with natural pacing and intonation. A clear, warm voice makes callers comfortable continuing the conversation, while robotic speech makes them hang up — so voice choice is part of the customer experience.

Related terms

speech-to-text →
voice AI agent →
AI receptionist →

Ahoya is an AI receptionist that answers every call 24/7.

Start free

What is text-to-speech?

Why voice quality matters

See also

Related terms