Explanation on why off-the-self MAE models do not need proportional attention in `ToMeAttention.forward()`? #49

Lopa07 · 2024-11-22T19:46:21Z

For MAE off-the-self models, self._tome_info['prop_attn'] should be False as mentioned here. Why for MAE off-the-self models, in ToMeAttention.forward() proportional attention is not needed?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explanation on why off-the-self MAE models do not need proportional attention in `ToMeAttention.forward()`? #49

Explanation on why off-the-self MAE models do not need proportional attention in `ToMeAttention.forward()`? #49

Lopa07 commented Nov 22, 2024

Explanation on why off-the-self MAE models do not need proportional attention in ToMeAttention.forward()? #49

Explanation on why off-the-self MAE models do not need proportional attention in ToMeAttention.forward()? #49

Comments

Lopa07 commented Nov 22, 2024

Explanation on why off-the-self MAE models do not need proportional attention in `ToMeAttention.forward()`? #49

Explanation on why off-the-self MAE models do not need proportional attention in `ToMeAttention.forward()`? #49