The texts for these datasets are from Texts for the Ukrainian Text-to-Speech dataset
Join Ukrainian community - https://t.me/speech_synthesis_uk
Donate using Monobank - https://send.monobank.ua/jar/3Saxixsdua
- Quality: high
- Duration: 10h37m
- Audio formats: WAV, OPUS
- Frequency: 48000 Hz, 22050 Hz, 16000 Hz
Listen to DEMO (choose "lada" in the Voice field)
- Quality: high
- Duration: 8h
- Audio formats: WAV, OPUS
- Frequency: 48000 Hz, 22050 Hz, 16000 Hz
- Quality: high
- Duration: 2h40m
- Audio formats: OPUS
- Frequency: 48000 Hz
- Quality: high
- Duration: 8h10m
- Audio formats: WAV, OPUS
- Frequency: 48000 Hz, 22050 Hz, 16000 Hz
Listen to DEMO (choose "mykyta" in the Voice field)
- Quality: high
- Duration: 6h
- Audio formats: OPUS
- Frequency: 48000 Hz
- Align Text to Audio and Trim Silence: https://github.com/proger/uk
- NVIDIA's Flowtron: https://github.com/egorsmkv/ukrainian-flowtron-tts
- HF demos:
- Lada: Ukrainian High-Quality Female Text-to-Speech Dataset: https://zenodo.org/record/7396774
- Google Colabs (RADTTS model):
- Lada is in Piper - https://github.com/rhasspy/piper - A fast, local neural text to speech system
- Tetiana in Balacoon - https://balacoon.com/blog/uk_release/