Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Good recover from real melspectrogram to audio, but problem in autoencoder. #23

Open
ericwudayi opened this issue May 5, 2020 · 3 comments

Comments

@ericwudayi
Copy link

Melgan vocoder can convert the real melspectrogram to waveform well, but I find that there is a problem when melgan vocoder convert the "model output melspectrogram". The melspectrogram extracted from audio is this.
image

and this one is my autoencoder output:
image

I find that melgan cannot generate good waveform from second one, but the first one is good. (This melspectrogram is not in the training set of my melgan vocoder) Is this the overfitting problem or something else?

@superhg2012
Copy link

have you solved this problem? I met same issue

@ericwudayi
Copy link
Author

No, but maybe we can try to solve it. Does it make sense that we feed the melspectrogram output generated by pretrained autoencoder as training data of MelGAN?

@superhg2012
Copy link

I tried mel-spectrogram generated by Tacotron to train melgan, bad result. I turn back to ground_truth mel-spectrogram extracted from audio,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants