Skip to content

Am I misunderstanding how masked training works? #347

Pinned Answered by mx
RandomKaos999 asked this question in Q&A
Discussion options

You must be logged in to vote

Masked training doesn't work as you think it does. There are two issues here, the first being likely less important than the second for your use case.

(1) Exact masks in pixel space aren't exact in latent space. Remember both the image and the mask have to get transformed through the VAE.

(2) When you mask an area and set 0 unmasked weight, you aren't saying "only train what's inside of this mask", you are telling the model "ONLY what appears in this masked section matters". That's a subtle but extremely important difference. It means that the model can do ANYTHING outside of that section and it will not penalize the model for being wrong. So if it learned that in other images there are p…

Replies: 1 comment

Comment options

mx
Jun 18, 2024
Collaborator

You must be logged in to vote
0 replies
Answer selected by mx
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants