Some question of KLD #3

CXX1113 · 2019-04-07T02:15:42Z

KLD = -0.5 / n_nodes * torch.mean(torch.sum(1 + 2 * logvar - mu.pow(2) - logvar.exp().pow(2), 1))
/ n_nodes should be removed or
torch.mean → torch.sum

The text was updated successfully, but these errors were encountered:

AllenWu18 · 2019-04-15T01:51:50Z

then is it should be
KLD = -0.5 * torch.mean(torch.sum(1 + 2 * logvar - mu.pow(2) - logvar.exp().pow(2), 1))
or
KLD = -0.5 / n_nodes * torch.sum(torch.sum(1 +2 * logvar - mu.pow(2)-logvar.exp().pow(2) , 1))?

YH-UtMSB · 2019-08-26T05:14:02Z

@alanlisten @AllenWu18 you both are right about the KLD. The author has clarified that 1 / n_nodes serves as a rescaling parameter (like \beta in beta-vae) to weaken the regularization from KLD. Check this issue: Loss function in optimizer.py #20.

Dzhilin · 2021-07-19T04:03:09Z

Please tell me what is the meaning of "norm" in the loss function? Looking forward to your reply！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some question of KLD #3

Some question of KLD #3

CXX1113 commented Apr 7, 2019

AllenWu18 commented Apr 15, 2019

YH-UtMSB commented Aug 26, 2019 •

edited

Loading

Dzhilin commented Jul 19, 2021

Some question of KLD #3

Some question of KLD #3

Comments

CXX1113 commented Apr 7, 2019

AllenWu18 commented Apr 15, 2019

YH-UtMSB commented Aug 26, 2019 • edited Loading

Dzhilin commented Jul 19, 2021

YH-UtMSB commented Aug 26, 2019 •

edited

Loading