Good recover from real melspectrogram to audio, but problem in autoencoder. #23

ericwudayi · 2020-05-05T09:34:38Z

Melgan vocoder can convert the real melspectrogram to waveform well, but I find that there is a problem when melgan vocoder convert the "model output melspectrogram". The melspectrogram extracted from audio is this.

and this one is my autoencoder output:

I find that melgan cannot generate good waveform from second one, but the first one is good. (This melspectrogram is not in the training set of my melgan vocoder) Is this the overfitting problem or something else?

superhg2012 · 2020-05-18T02:34:07Z

have you solved this problem? I met same issue

ericwudayi · 2020-05-18T14:09:56Z

No, but maybe we can try to solve it. Does it make sense that we feed the melspectrogram output generated by pretrained autoencoder as training data of MelGAN?

superhg2012 · 2020-05-19T02:42:49Z

I tried mel-spectrogram generated by Tacotron to train melgan, bad result. I turn back to ground_truth mel-spectrogram extracted from audio,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Good recover from real melspectrogram to audio, but problem in autoencoder. #23

Good recover from real melspectrogram to audio, but problem in autoencoder. #23

ericwudayi commented May 5, 2020

superhg2012 commented May 18, 2020

ericwudayi commented May 18, 2020

superhg2012 commented May 19, 2020

Good recover from real melspectrogram to audio, but problem in autoencoder. #23

Good recover from real melspectrogram to audio, but problem in autoencoder. #23

Comments

ericwudayi commented May 5, 2020

superhg2012 commented May 18, 2020

ericwudayi commented May 18, 2020

superhg2012 commented May 19, 2020