OmniVoice Demo

State-of-the-art text-to-speech model for 600+ languages, supporting:

  • Voice Clone — Clone any voice from a reference audio
  • Voice Design — Create custom voices with speaker attributes

Built with OmniVoice by Xiaomi Next-gen Kaldi team.

Recommended: 3–10 seconds audio.

Language (optional) / 语种 (可选)

Keep as Auto to auto-detect the language.