Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> It's unfortunate that I can't export audio clips locally; otherwise I would immediately look into using this for generating my Finnish flashcard decks from the same material [2].

elevenlabs has an API which seemed quite reasonable when I looked into it. A bit of python should get you what you want pretty quickly.



Oh! I'll look into that, thanks.


I did exactly this for my finnish anki flashcards. you can see the implementation here: https://github.com/w3p706/anki-gen-fin/blob/main/ankigenfin/...

If you are looking to convert very short texts or words into speach, I had best result with eleven_multilingual_v2 with the following text for tts "Hän sanoo rauhallisesti ja hitaasti: <break time=\"1.0s\" /> '${text}'" An then i use a postprocessing to split at the silence.

This was nessesary as you cannot set the language explicitly and it is detected from the input.

With eleven_turbo_v2_5 you can set the language, but the results are not as good.


This is a cool repo. Interesting approach using uralicNLP for morphology, that's not one I've seen before. This repo's README.md is excellent and thorough too - I'll probably come back to this in March and give it a spin for myself, just to see what you're up to in a little more detail.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: