Product Thumbnail

MiniMax Audio

Level Up Your Audio with Realistic AI Voices

Artificial Intelligence
Tech
Audio

MiniMax Audio just leveled up with the new Speech-02 model! Get ultra-realistic Al voices (30+ langs, 99% similarity). Read Files/URLs & handle long text (200k chars). API available at: [email protected].

Top comment

Hi everyone! MiniMax Audio just leveled up with their new Speech-02 voice models! They're pushing for ultra-realistic text-to-speech that sounds incredibly natural and expressive across many languages. What's new with Speech-02 & the platform: 🗣️ Realistic & Expressive: Aims for "native flair" in 30+ languages, with studio-grade clarity and zero rhythm glitches. Think flawless multi-language transitions and even deep, cinematic bass tones. 📎 Read Anything: Instantly turn any file or URL into lifelike audio. 📜 Long-Text Mode: Effortlessly create audiobooks/podcasts with up to 200,000 characters in a single go. 🎤 Unlimited Voice Cloning: Still offers unlimited voice cloning to personalize your audio. ⚡ Sub-Second Streaming: Delivers audio quickly for real-time applications. 💰 Same Price: All these improvements apparently come at the same affordable price point. This combination of high-quality, expressive multilingual voices and practical features like file reading and long-text support are really powerful for creators.

Comment highlights

MiniMax Audio just leveled up with the new Speech-02 model


🚀 ALL New Speech-02 Series Enhanced authentic multi-language coverage, 99% vocal similarity, zero glitches in rhythm, and studio-grade clarity - all at the same affordable price.


📎 Read Anything Upload files or URLs and enjoy listening to any content in your preferred voices, anytime, anywhere.


📜 Long-Text Mode Effortlessly create audiobooks or podcasts without truncation; supports up to 200,000 characters of asynchronous speech synthesis in a single input.


📂 Enhanced History Management Review, delete, or organize your speech synthesis history and settings with ease.


🔍 New Discovery Hub Discover all features and learn what's new in one convenient place.


⚡Try the new MiniMax Audio Now: https://www.minimax.io/audio

⚡API access coming soon—visit https://minimax.io/platform/ or contact [email protected] for early access!

MiniMax Audio just leveled up with the new Speech-02 model


🚀 ALL New Speech-02 Series Enhanced authentic multi-language coverage, 99% vocal similarity, zero glitches in rhythm, and studio-grade clarity - all at the same affordable price.


📎 Read Anything Upload files or URLs and enjoy listening to any content in your preferred voices, anytime, anywhere.


📜 Long-Text Mode Effortlessly create audiobooks or podcasts without truncation; supports up to 200,000 characters of asynchronous speech synthesis in a single input.


📂 Enhanced History Management Review, delete, or organize your speech synthesis history and settings with ease.


🔍 New Discovery Hub Discover all features and learn what's new in one convenient place.


⚡Try the new MiniMax Audio Now: https://www.minimax.io/audio

⚡API access coming soon—visit https://minimax.io/platform/ or contact [email protected] for early access!

The progress in natural and expressive speech with Speech-02 is seriously impressive. MiniMax Audio is setting a new bar for multilingual voice quality.

These are pretty good. I think on par with 11labs. But as with 11labs, very skewed towards audio book narration. I guess thats where the demand is at the moment. I wonder if its possible to create voices that are suitable for productvideos, commercials etc. Or even reviews. Similar to openAI's Alloy voice.