text to speech - chunhualiao/public-docs GitHub Wiki

https://x.com/deedydas/status/1914714739432939999

4h We just solved text-to-speech AI.

This model can simulate perfect emotion, screaming and show genuine alarm. — clearly beats 11 labs and Sesame — it’s only 1.6B params — streams realtime on 1 GPU — made by a 1.5 person team in Korea!!

It's called Dia by Nari Labs.