grok-tts

Available Models

grok-tts
$$$$
$15/1M

Provider Overview

Grok TTS is an expressive, human-like text-to-speech. It features five built-in voices and supports over 20 languages with automatic detection. Users and developers can also add inline tags to inject specific emotions like laughing, whispering, or natural pauses into the narration

Highly ExpressiveMultilingualSpeech tags support
5Total Voices
3Male
2Female
Showing 2 voices
Name
US flagAra(F)
US flagEve(F)
Casting Studio Chat
Hi! I am your Acoustic Casting Assistant. I can help you find the right voice based on the vibe you need, or give feedback on any voice you are listening to.

Guest Session: History not saved

Describe the voice you need... Use '@' for specific voices, or '/' to filter by provider.