getting a list of all speakers and the P#? #206
Replies: 5 comments 3 replies
-
This is a great question, you did not miss anything and it's not at all obvious unless you get into the docs for Coqui TTS. In fact I think it's not even that clear how speakers and voice models are connected. First, there are basically two models available (though technically you can use any model that works with Coqui TTS, I only ever test/validate with the two I find to be the best). Those models are VITS and XTTSv2. epub2tts defaults to the model You can find all available speaker IDs for VITS from the command line with I added the following script to the utils directory that will generate samples for vits speakers:
To use the speakers that have actual names (these were Coqui Studio voices), you need to be using XTTS, and the command would look like You can run the script Hope this helps, let me know if you still have questions or are running into any problems. |
Beta Was this translation helpful? Give feedback.
-
Hello, and thanks for this wonderful project! I hope someone will fork and continue the development of TTS. I have just started to use it and I love it. One newbie question: how does the command epub2tts mybook.txt --engine xtts --speaker "Damien Black" --cover cover-image.jpg --sayparts should be modified for handling text in Italian and in French? Thanks! |
Beta Was this translation helpful? Give feedback.
-
You can just add Be aware though that non-english languages do not always turn out great. There have been a few bugs opened here regarding issues with other languages (like #153 for instance). There is some more work to be done on this, especially around limiting the length of sentence that gets sent for TTS. |
Beta Was this translation helpful? Give feedback.
-
I have not tried an installation on Debian, but ideally if the installation happens in a virtual environment, there shouldn't be any conflicts. I am planning to update the installation instructions to use pipenv which should be a huge improvement in terms of ease of installation. I also realize the README doesn't really say how to update other than in the windows instructions, I can't believe I missed that! The way to update is activate your virtual environment, then |
Beta Was this translation helpful? Give feedback.
-
I hope someone might find this useful, it appears the voices for the P# voices (Vits Encoding Trained on Vctk Dataset) can be found at this site. At least they sound the same to me :-). |
Beta Was this translation helpful? Give feedback.
-
Is there a way to list out all the possible speaker P#'s and maybe hear what they sound like? I tried the utility that created a bunch of sample wavs with names but when I try to use the speaker name it never works. when using the P# does work every time but wanted to get a list of what is possible. I looked around but did not find any docs on this or I most likely overlooked it. TIA
Beta Was this translation helpful? Give feedback.
All reactions