Text-to-Speech (TTS)
Text-to-Speech (TTS) has evolved from robotic voices to neural voice profiles with pauses, emphasis, and sometimes real-speaker cloning. APIs make it easy to read responses from LLM out loud.
Always verify rights and consent for cloned voices. Tools: ElevenLabs, Descript, HeyGen. Counterpart: STT.
Key characteristics
- Converts written text into synthetic speech with selectable voice, tone, and sometimes emotional style.
- Is important for accessibility, voiceovers, voice assistants, and automated content publishing.
- Requires consent checks, quality testing, and language support for reliable commercial use.