Microsoft MAI-Voice-2
Expressive TTS with voice cloning in 15 languages
Productivity
Developer Tools
Artificial Intelligence

Upvotes106

▲ 106View on ProductHunt ⧉

Comments5

5 commentsSee comments on PH ⧉

Featured onJune 5th, 2026

Hunted by

Habib Ferdous

Shipixen

Go from nothing → deployed Next.js codebase in minutes • Sponsored

Get Shipixen ⧉

Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Microsoft MAI-Voice-2

Expressive TTS with voice cloning in 15 languages

Microsoft's most expressive TTS model yet — voice cloning from short samples, fine-grained emotional control, and consistent voice identity across 15 languages. Now live in Azure AI Foundry at $22 per million characters, with integrations rolling out in VSCode, Dynamics 365 Contact Center, and Teams. For builders shipping voice agents who need production-grade prosody without the OpenAI Realtime API price tag.

Top comment

Upvotes106

▲ 106View on ProductHunt ⧉

Comments5

5 commentsSee comments on PH ⧉

Product of the Day13rd

I build voice agents for service businesses — mostly healthcare and home services — and the #1 unsolved problem in this space is prosody. The "is this a robot?" moment usually happens in the first 8 seconds of a call. MAI-Voice-2 is the first TTS I've A/B tested where my pilot users couldn't tell. The $22/M chars pricing lands below ElevenLabs and matches gpt-realtime's TTS layer. If you're shipping voice and wedded to OpenAI Realtime, worth running the side-by-side. Curious if Microsoft is planning sub-200ms first-token latency via WebRTC streaming next.

About Microsoft MAI-Voice-2 on Product Hunt

“Expressive TTS with voice cloning in 15 languages”

Microsoft MAI-Voice-2 launched on Product Hunt on June 5th, 2026 and earned 106 upvotes and 5 comments, placing #13 on the daily leaderboard. Microsoft's most expressive TTS model yet — voice cloning from short samples, fine-grained emotional control, and consistent voice identity across 15 languages. Now live in Azure AI Foundry at $22 per million characters, with integrations rolling out in VSCode, Dynamics 365 Contact Center, and Teams. For builders shipping voice agents who need production-grade prosody without the OpenAI Realtime API price tag.

On the analytics side, Microsoft MAI-Voice-2 competes within Productivity, Developer Tools and Artificial Intelligence — topics that collectively have 1.6M followers on Product Hunt. The dashboard above tracks how Microsoft MAI-Voice-2 performed against the three products that launched closest to it on the same day.

Who hunted Microsoft MAI-Voice-2?

Microsoft MAI-Voice-2 was hunted by Habib Ferdous. A “hunter” on Product Hunt is the community member who submits a product to the platform — uploading the images, the link, and tagging the makers behind it. Hunters typically write the first comment explaining why a product is worth attention, and their followers are notified the moment they post. Around 79% of featured launches on Product Hunt are self-hunted by their makers, but a well-known hunter still acts as a signal of quality to the rest of the community. See the full all-time top hunters leaderboard to discover who is shaping the Product Hunt ecosystem.

For a complete overview of Microsoft MAI-Voice-2 including community comment highlights and product details, visit the product overview.

Microsoft MAI-Voice-2Expressive TTS with voice cloning in 15 languagesProductivityDeveloper ToolsArtificial Intelligence

Product upvotes and comments

Product vs the next 3

Top comment

About Microsoft MAI-Voice-2 on Product Hunt

Who hunted Microsoft MAI-Voice-2?

Microsoft MAI-Voice-2
Expressive TTS with voice cloning in 15 languages
Productivity
Developer Tools
Artificial Intelligence