This repository has been archived by the owner on Nov 23, 2023. It is now read-only.

paniedziela / ukrainian-tts-datasets Public archive

forked from fducom/ukrainian-tts-datasets

Notifications You must be signed in to change notification settings
Fork 0
Star 0

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

Apache-2.0 license

0 stars 4 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
kateryna		kateryna
lada		lada
mykyta		mykyta
oleksa		oleksa
tetiana		tetiana
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Repository files navigation

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

The texts for these datasets are from Texts for the Ukrainian Text-to-Speech dataset

Join Ukrainian community - https://t.me/speech_synthesis_uk

Donate using Monobank - https://send.monobank.ua/jar/3Saxixsdua

Voices

Female

Lada

Quality: high
Duration: 10h37m
Audio formats: WAV, OPUS
Frequency: 48000 Hz, 22050 Hz, 16000 Hz

Listen to DEMO (choose "lada" in the Voice field)

Tetiana

Quality: high
Duration: 8h
Audio formats: WAV, OPUS
Frequency: 48000 Hz, 22050 Hz, 16000 Hz

Kateryna

Quality: high
Duration: 2h40m
Audio formats: OPUS
Frequency: 48000 Hz

Male

Mykyta

Quality: high
Duration: 8h10m
Audio formats: WAV, OPUS
Frequency: 48000 Hz, 22050 Hz, 16000 Hz

Listen to DEMO (choose "mykyta" in the Voice field)

Oleksa

Quality: high
Duration: 6h
Audio formats: OPUS
Frequency: 48000 Hz

Appearance on the web

Align Text to Audio and Trim Silence: https://github.com/proger/uk
NVIDIA's Flowtron: https://github.com/egorsmkv/ukrainian-flowtron-tts
HF demos:
- https://huggingface.co/spaces/robinhad/ukrainian-tts
- https://huggingface.co/spaces/theodotus/ukrainian-voices
Lada: Ukrainian High-Quality Female Text-to-Speech Dataset: https://zenodo.org/record/7396774
Google Colabs (RADTTS model):
- https://colab.research.google.com/drive/13aa0o9fQknDcJtpLrGXhxWPvZpeUggCy?usp=sharing
- https://colab.research.google.com/drive/1pgiBlMm4tk0atKrszStOSy6XaTDnc3v4?usp=sharing
Lada is in Piper - https://github.com/rhasspy/piper - A fast, local neural text to speech system
Tetiana in Balacoon - https://balacoon.com/blog/uk_release/
- Demo: https://huggingface.co/spaces/balacoon/tts

About

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%