-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training #30
Comments
VGGSound-only training: modify this file https://github.com/hkchengrex/MMAudio/blob/main/mmaudio/data/data_setup.py <8s inference: Yes. The demo script already supports this. As in longer duration evaluation, using a duration that significantly differs from the training duration might introduce artifacts. |
Thanks a lot for your response. What do these two lines mean? Are they used during the training? https://github.com/hkchengrex/MMAudio/blob/34bf089fdd2e457cd5ef33be96c0e1c8a0412476/config/data/base.yaml#L31C1-L32C22 |
In addition, I met this issue during the training: Do you have any ideas about how to resolve it? /usr/bin/ld: cannot find -lcuda: No such file or directorycollect2: error: ld returned 1 exit status/usr/bin/ld: cannot find -lcuda: No such file or directorycollect2: error: ld returned 1 exit status[2025-01-12 06:52:22][r3][ERROR] - Error occurred at iteration 0![2025-01-12 06:52:22][r3][CRITICAL] - backend='inductor' raised: |
Thanks. Those two lines are for the evaluation caches. I have updated the readme to reflect this. For the error: can you show the full stack trace? |
Hi, thank you very much. I solved this issue. I have another question about the training script. Does it support gradient accumulation for saving GPU memory? |
And also, for the 44k case, why the number of samples is 353280 rather than 352800?
|
|
Hi, could I learn about how to enable the training with the VGGSound only and without text-audio pairs? In addition, whether it supports v2a generation less than 8s during inference? Many thanks
The text was updated successfully, but these errors were encountered: