Thanks. What we've shared here is a demo tool to show our new speech model that can clone a voice with few seconds of audio. You can try that with English or non-English recordings, but the generated voice can only speak English at the moment. If you are looking for high-fidelity cloning, you can sign up and try it in our app here - https://play.ht/voice-cloning/
High-fidelity cloning requires at least 20 mins of good quality audio. The more the better.
High-fidelity cloning requires at least 20 mins of good quality audio. The more the better.