Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make the training reproducible #22

Open
rodfersou opened this issue Nov 5, 2024 · 1 comment
Open

Make the training reproducible #22

rodfersou opened this issue Nov 5, 2024 · 1 comment

Comments

@rodfersou
Copy link

Great work so far!

We are trying to reproduce the training with more dialects:

  • As a first step, we plan to change the voice of the wave files based on the original corpus set of waves. (I'm running Coqui to convert the voice to another style of voice)
  • Later on, we plan to use our own set of wave files with the new dialect

The issues I found so far in the first step:

  • There is no documentation about running the scripts/extract_f0.py part to create the data/pitch_dict.pt file; it would be good to add this to the docs. I learned it by reading some code.
  • After this part, when I first run the train.py file with my custom configuration, I get an error that the ArabDataset class is not being called correctly; it is passing a cache parameter that doesn't exist in the actual implementation
    image
  • I removed the parameter to try to brute force the results, but it led to another error:
Traceback (most recent call last):
  File "tts-arabic-pytorch/train.py", line 205, in <module>
    main()
  File "tts-arabic-pytorch/train.py", line 157, in main
    train_loader = DataLoader(train_dataset,
  File "python3.9/site-packages/torch/utils/data/dataloader.py", line 376, in __init__
    sampler = RandomSampler(dataset, generator=generator)  # type: ignore[arg-type]
  File "python3.9/site-packages/torch/utils/data/sampler.py", line 164, in __init__
    raise ValueError(
ValueError: num_samples should be a positive integer value, but got num_samples=0

Am I in the right path? Can you please double-check if the tools in the master branch are the last version used to train the data?

Also, I would like to know if I need to use Python 3.9 for this process; the script's dependencies did not work with newer versions.

@rodfersou rodfersou changed the title Make the training reproductible Make the training reproducible Nov 5, 2024
@nipponjo
Copy link
Owner

nipponjo commented Nov 5, 2024

Hi,
I have wanted to add some updates for a while. If you haven't already, you can take a look at the exp branch and the training.md file there. I have added some support for training without the phonemizer there and started to write some instructions.
The train.py in the root folder doesn't work (I should remove it), the working training scripts are in the /scripts folder. I haven't had much time to update the master branch and it could take some days, but I will mostly add the changes from the exp branch. If anything is unclear about the instructions, just let me know, and I will see if I can add something there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants