Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions of image size and the embdding. #1

Open
spiralanch opened this issue Feb 26, 2024 · 6 comments
Open

Some questions of image size and the embdding. #1

spiralanch opened this issue Feb 26, 2024 · 6 comments

Comments

@spiralanch
Copy link

Thanks for the amazing work, the result is awesome on https://lsfhuihuiff.github.io/MusicTI/.
Still have some problems that not well understand below, looking forward to your reply, thanks!

  1. How i get the mel-spectrograms image? Can you offer the script, please? Is the same picture size between different length of audio, for example, 5 seconds audio and 60 seconds audio have the same image size?
  2. Whether to offer the pretrained embdding for faster testing the result? “--embedding_path /path/to/logs/trained_model/checkpoints/”
@lsfhuihuiff
Copy link
Owner

1.As mentioned in README, you can refer to Riffusion for the conversion between spectrograms and audio. Spectrograms of different durations correspond to spectrograms of different lengths, while the width remains consistent.
2.Each style audio corresponds to one model, making it difficult to upload the models. I will attempt to upload 1-2 models.

@spiralanch
Copy link
Author

spiralanch commented Feb 27, 2024

Thanks your reply.
The model files on Google drive need permission, request access all ready. It would be better if permissions could be opened.

1.As mentioned in README, you can refer to Riffusion for the conversion between spectrograms and audio. Spectrograms of different durations correspond to spectrograms of different lengths, while the width remains consistent. 2.Each style audio corresponds to one model, making it difficult to upload the models. I will attempt to upload 1-2 models.

@lsfhuihuiff
Copy link
Owner

They have been made public.

@spiralanch
Copy link
Author

They have been made public.

The model files are only 5 kB per item, errror happened when loading the *.pt, please recheck if the files is right.

@lsfhuihuiff
Copy link
Owner

I apologize for any errors that may have occurred during the upload. I have already corrected them.

@farahhuifanyang
Copy link

I apologize for any errors that may have occurred during the upload. I have already corrected them.

Hey, guy! See if u could fix this anyway.
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants