Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They've really nailed the back and fourth of the two speakers!

It would be interesting to know if it's multimodal voice, or just clever prompting and recombining...

I added single voice podcasts to Magpai after seeing how useful this was. Allows for a bit more customisation of the podcast too https://www.youtube.com/watch?v=OEsh9MlbA6s

I've got a daily podcast of hackernews being generated here too: https://www.magpai.app/share/n7R91q



It's almost certainly Google SoundStorm, a traditional TTS trained on dialogs from last year: https://x.com/jonathanfly/status/1675987073893904386




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: