You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Looking around, I've discovered the StyleTTS 2 model.
It seems to be of a much higher quality than other Voice Cloning TTS models, so I think it would be nice to support.
From the statistics mentioned in it's page, it seems to be a little more optimised than YourTTS, which is already supported by SpeechNote.
Alike to what SpeechNote does, it seems to work well when generated on a sentence-by-sentence basis. However, the quality degrades on smaller segments of text.
Looking around, I've discovered the StyleTTS 2 model.
It seems to be of a much higher quality than other Voice Cloning TTS models, so I think it would be nice to support.
From the statistics mentioned in it's page, it seems to be a little more optimised than YourTTS, which is already supported by SpeechNote.
Alike to what SpeechNote does, it seems to work well when generated on a sentence-by-sentence basis. However, the quality degrades on smaller segments of text.
There is a sweet spot to maximize quality.
You can find the source here.
Thanks for the awesome program!
The text was updated successfully, but these errors were encountered: