Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about FeatUp #6

Open
zcc00210 opened this issue Nov 29, 2024 · 3 comments
Open

Question about FeatUp #6

zcc00210 opened this issue Nov 29, 2024 · 3 comments

Comments

@zcc00210
Copy link

zcc00210 commented Nov 29, 2024

Hello,
Thanks for your amazing work! I have a few questions about using Featup:

  1. How do you use FeatUp to enhance the pixelated features of DINOv2? Is it for a vector n * H * W (n is the dimension of features) or an image? If it is convenient, could you please mark the relevant code?
  2. When FeatUp is used, is ckpt without hierarchical specifications being used? Are there any differences between the two ckpt types in enhancing the features of DINO v2?
    Looking forward to your reply!
@yhyang-myron
Copy link
Member

Hi, thanks for your interest in our work!

  1. We use FeatUp in our pre-training stage. We distill the feature from FeatUp-DINOv2 to the 3D backbone via a MSE Loss.
  2. What does a ckpt without hierarchical specifications refer to?

@zcc00210
Copy link
Author

Hi, thanks for your interest in our work!

  1. We use FeatUp in our pre-training stage. We distill the feature from FeatUp-DINOv2 to the 3D backbone via a MSE Loss.
  2. What does a ckpt without hierarchical specifications refer to?

Sorry for my wrong expression of the second question, I want to ask, do you use checkpoint with No LayerNorm (as shown below)?
image
May I ask that does the pre-training stage refer to the process of building PTv3-object. And the corresponding code is not released?
As I want to use FeatUp in the pixel-by-pixel feature map at reduced resolution, but I don't know much about its network structure. So I want to learn from your work and see how FeatUp applies to feature vectors. If you can provide the relevant code, I would be grateful!!

@yhyang-myron
Copy link
Member

yhyang-myron commented Dec 1, 2024

Hi, the FeatUp we used is with additional LayerNorm:
torch.hub.load("mhamilton723/FeatUp", 'dinov2')

And the code is like this:

self.Encoder_2d = torch.hub.load("mhamilton723/FeatUp", 'dinov2').cuda().eval()
img_feat = self.Encoder_2d(imgs)
img_feat = F.interpolate(img_feat, size=(512, 512))
img_feat = rearrange(img_feat, '(b s) c h w -> b s c h w', b=B)
loss = F.mse_loss(point_feat, img_feat)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants