Intuition of feature-wise sorting? #4

Jacobew · 2020-08-28T07:46:42Z

Hi, could you please shed some light on the feature-wise sorting?
Though this operation is permutation-invariant, I'm still having trouble understanding it.
In the paper it says "A transformation (such as with an MLP) prior to the pooling can ensure that the features being sorted are mostly independent so that little information is lost by treating the features independently."
Why can this operation help to solve the problem of a significant bottleneck when compressing a set of any size down to a single feature vector?

Jacobew · 2020-08-28T07:50:17Z

Another question: why restoring the order in the decoder can eliminate the need for the assignment-based loss? In this way, the decoder would output the elements in the same arbitrary order as input elements, however, the order of gt is fixed I think.
Please correct me if I'm making mistakes here. :)

Jacobew · 2020-08-28T07:59:34Z

I am also a little frustrated about figure1 (the color and the dashed box), for which I think a concrete example including input coordinate number and simple network transformation is more demonstrative.

Cyanogenoid · 2020-09-02T10:39:55Z

Why can this operation help to solve the problem of a significant bottleneck when compressing a set of any size down to a single feature vector?

The point you quoted is somewhat separate from the bottleneck problem. In general, you can't really entirely eliminate the bottleneck problem when going from a set of vectors to a single vector. By making the pooling operation learnable, the idea in FSPool is that we can reduce the bottleneck problem by learning what information is relevant and being able to throw out information we don't care about.

With that sentence, I'm referring to the following: some people might argue that because each feature is sorted independently, relationships between features within each element are lost. My argument is that an MLP before the pooling can learn to decorrelate the feature dimensions so that that isn't a problem.

why restoring the order in the decoder can eliminate the need for the assignment-based loss?

Correct, the output is the same arbitrary order as the input elements. I don't know what you mean with the order of the ground-truth being fixed. The point is that regardless of what this ordering is, because the "first" element in the output set corresponds to the "first" element in the input set when you use FSUnpool, we can just use a normal pairwise mean squared error as loss now. There is no need for the assignment-based losses anymore, since now we essentially just have a sequence regression problem.

For a concrete example, have a look at the video for ICLR 2020 https://iclr.cc/virtual_2020/poster_HJgBA2VYwH.html
Hopefully this clears up some of the other points as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intuition of feature-wise sorting? #4

Intuition of feature-wise sorting? #4

Jacobew commented Aug 28, 2020

Jacobew commented Aug 28, 2020 •

edited

Loading

Jacobew commented Aug 28, 2020

Cyanogenoid commented Sep 2, 2020 •

edited

Loading

Intuition of feature-wise sorting? #4

Intuition of feature-wise sorting? #4

Comments

Jacobew commented Aug 28, 2020

Jacobew commented Aug 28, 2020 • edited Loading

Jacobew commented Aug 28, 2020

Cyanogenoid commented Sep 2, 2020 • edited Loading

Jacobew commented Aug 28, 2020 •

edited

Loading

Cyanogenoid commented Sep 2, 2020 •

edited

Loading