is KLD calculation correct? #3

victor-shepardson · 2017-09-30T00:26:55Z

Lines 38 to 41 in 0915760

    
           t1 = (samples_var.pow(2) + samples_mean.pow(2)) / 2 
        
           t2 = -samples_var.log() 
        
           KL = (t1 + t2 - 0.5).mean()

KLD appears to use variance in place of standard deviation. utils.var() computes variance as squared distance from mean. Then it's squared again in the KLN01Loss module. Should it be (in the default 'qp' direction):

t1 = samples_var + samples_mean.pow(2)
t2 = -samples_var.log()

KL = (t1 + t2 - 1).mean()/2

?

(Additionally, the paper gives the KLD as a sum but here it's a mean, changing the meaning of the hyperparameters weighting the reconstruction losses)

The text was updated successfully, but these errors were encountered:

DmitryUlyanov · 2017-09-30T17:42:37Z

Hi, yes, it looks like a mistake. Thanks for spotting it.

I will also change sum to mean in the paper, thank again!

Best,
Dmitry

DmitryUlyanov · 2017-09-30T17:43:31Z

I will fix it in in several days, when I will have time to make sure everything still works.

victor-shepardson · 2017-09-30T23:41:36Z

No problem! I think the reconstruction losses in the paper are similarly given as norms, where they are also means in the code. And the latent space loss is said to be L2 but appears to really be cosine.

jshanna100 · 2018-09-19T08:50:46Z

The encoder transforms all output vectors to have a norm of 1 (i.e. mapping to a unit sphere). But if this is the case, a batch of such vectors cannot reach the unit Gaussian as demanded by the KL loss function, even when perfectly distributed around the sphere.

Have I missed something, or shouldn't the loss function be calculated with the standard deviation of the transformed vectors, rather than 1?

DmitryUlyanov · 2018-09-19T15:50:00Z

KL divergence in fact will not be zero in the perfect case, but when KL is minimal Q ~ uniform on sphere, that is what we want.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is KLD calculation correct? #3

is KLD calculation correct? #3

victor-shepardson commented Sep 30, 2017

DmitryUlyanov commented Sep 30, 2017

DmitryUlyanov commented Sep 30, 2017

victor-shepardson commented Sep 30, 2017

jshanna100 commented Sep 19, 2018

DmitryUlyanov commented Sep 19, 2018

is KLD calculation correct? #3

is KLD calculation correct? #3

Comments

victor-shepardson commented Sep 30, 2017

DmitryUlyanov commented Sep 30, 2017

DmitryUlyanov commented Sep 30, 2017

victor-shepardson commented Sep 30, 2017

jshanna100 commented Sep 19, 2018

DmitryUlyanov commented Sep 19, 2018