State-of-the-art text-to-speech model for 600+ languages, supporting:
Built with OmniVoice by Xiaomi Next-gen Kaldi team.
Recommended: 3–10 seconds audio.
Keep as Auto to auto-detect the language.