Text-to-Speech (TTS)

Text-to-Speech (TTS) has evolved from robotic voices to neural voice profiles with pauses, emphasis, and sometimes real-speaker cloning. APIs make it easy to read responses from LLM out loud.

Always verify rights and consent for cloned voices. Tools: ElevenLabs, Descript, HeyGen. Counterpart: STT.

Key characteristics

Converts written text into synthetic speech with selectable voice, tone, and sometimes emotional style.
Is important for accessibility, voiceovers, voice assistants, and automated content publishing.
Requires consent checks, quality testing, and language support for reliable commercial use.