MARS5 TTS
Open-source, insanely prosodic text-to-speech model
Software Engineering
Artificial Intelligence
GitHub
Data Science

Featured onJune 14th, 2024

Unicorn Platform

Create a Website for Your Project Fast • Sponsored

Create your website ⧉

Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

MARS5 TTS

Open-source, insanely prosodic text-to-speech model

MARS5 an opensource TTS model to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime & more. Join our Discord https://discord.gg/4GVdQ28cZC today!

Top comment

Upvotes650

▲ 650View on ProductHunt ⧉

Comments153

153 commentsSee comments on PH ⧉

Product of the Day1st

CAMB.AI introduces MARS5, a fully open-source (commercially usable) TTS with break-through prosody and realism available on our Github: https://www.github.com/camb-ai/m... Why is it different? MARS5 is able to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime and more; hard prosody that most closed-source and open-source TTS models struggle with today. We're excited for you to try, build on and use MARS5 for research and creative applications. Let us know any feedback on our Discord! Highlights: Training data: Trained on over 150K+ hours of data. Params: 1.2 Bn (750/450) Multilingual: Open-sourcing in English to begin with, but can access it in 140+ languages on camb.ai Diversity in prosody: can handle very hard prosodic elements like commentary, shouting, anime etc.

MARS5 TTSOpen-source, insanely prosodic text-to-speech modelSoftware EngineeringArtificial IntelligenceGitHubData Science

Product upvotes and comments

Product vs the next 3

Top comment

MARS5 TTS
Open-source, insanely prosodic text-to-speech model
Software Engineering
Artificial Intelligence
GitHub
Data Science