Generate natural voiceovers and cloned voices.
OmniVoice covers fast text narration and reference-based voice cloning with simple speed and voice-style controls.
What OmniVoice is good for
Composable voice traits
Use validated descriptors such as accent, age, gender, pitch, and whisper style.
Speed control
Adjust delivery pace from slow narration to faster social-ready voiceovers.
Clone from audio
Use a clean sample to produce speech in the reference voice.
Turn text into a natural spoken track.
Describe voice traits, set the pacing, and generate audio from plain text.
Built for production voice workflows.
Use these pages as a focused audio workspace for scripts, product media, education content, and repeatable brand narration.
Product videos
Generate clear narration for feature launches, onboarding clips, release notes, and short demo videos.
Creator voiceovers
Produce consistent narration for reels, explainers, tutorials, and channel updates without recording every take.
Localized narration
Prepare alternate voice directions for region-specific campaigns, ads, and help center content.
Reference voice reuse
Use voice cloning when a project needs the same speaking identity across multiple scripts.
A simple path from script to usable audio.
The generation panel handles task creation, status polling, preview, and download. The supporting content helps you prepare better inputs before submitting.
Choose the mode
Use Text to Speech for new narration, or Voice Clone when a reference speaker should guide the output.
Prepare the input
Paste the final script, add voice descriptors, and upload clean reference audio for cloning tasks.
Generate and review
Submit the task, preview the finished audio, then download the result for editing or publishing.
Mode comparison
| Mode | Best for | Inputs | Output |
|---|---|---|---|
| Text to Speech | Fast narration and voiceover drafts | Text, voice description, speed | Generated speech audio |
| Voice Clone | Keeping a known voice across multiple scripts | Text, reference audio, optional transcript, speed | Cloned speech audio |
Input guide
Voice description
Combine age, accent, gender, pitch, and delivery traits with commas, such as female, young adult, moderate pitch.
Reference audio
Use a clean clip with one speaker, low background noise, and a speaking style close to the desired result.
Script length
Short paragraphs are easier to review and regenerate. Split long scripts into scene-level sections.
Copy-ready prompts for stronger first results.
Use these examples as starting points, then adapt them to your project, audience, and delivery style.
female, young adult, american accent, moderate pitch, calm studio narration
Create a polished 20-second product update with a steady pace and confident delivery.
Paste the transcript of the uploaded reference clip when available to improve pronunciation alignment.
Common questions before generating.
A short reference for choosing the right mode and preparing inputs before sending a task to the backend.
When should I use OmniVoice instead of Qwen3 TTS?
Use OmniVoice when you want simple voice trait controls, speed adjustment, or a fast reference-clone workflow.
What kind of audio works best for cloning?
Use a clear single-speaker clip with minimal background sound. A matching transcript can help the backend model understand the reference more accurately.
Can I change the generated speaking speed?
Yes. OmniVoice modes include speed control, so you can generate slower narration or faster delivery for short-form content.