Is it right to use `torch.mean(s_tgt)` when C < B？ #13

cantabile-kwok · 2021-11-18T16:26:14Z

Hi and I am studying your approach with your implementation. My question is that in your paper you use $\mathcal{L}_{b n m}=-\frac{1}{B_{U}}\left\|G\left(X^{U}\right)\right\|_{\star}$ (Equation 12) to compute the BNM loss, and the divisor is the batch size. But in BNM/DA/BNM/train_image.py L#164 I found that this is done with torch.mean(). Then if the class number is smaller than batch size, the SVD operation will generate a s_tgt with length C instead of B. Wouldn't that be incorrect according to the original equation? Why don't explicitly divide with the batch size?

The text was updated successfully, but these errors were encountered:

cuishuhao · 2021-11-20T01:59:35Z

I admit I am a little careless about the weight.
In the equation, batch B is divided.
In the code, I achieve it with min(B, C). L_bnm can be combined with a hyperparameter \lambda, and in this case the actual value of \lambda is changed.
In other practices, I find the value might be better to set as \sqrt(B * C), and the performance can be better with different values of \lambda.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it right to use `torch.mean(s_tgt)` when C < B？ #13

Is it right to use `torch.mean(s_tgt)` when C < B？ #13

cantabile-kwok commented Nov 18, 2021

cuishuhao commented Nov 20, 2021

Is it right to use torch.mean(s_tgt) when C < B？ #13

Is it right to use torch.mean(s_tgt) when C < B？ #13

Comments

cantabile-kwok commented Nov 18, 2021

cuishuhao commented Nov 20, 2021

Is it right to use `torch.mean(s_tgt)` when C < B？ #13

Is it right to use `torch.mean(s_tgt)` when C < B？ #13