Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question for backbone #10

Open
chaos1992 opened this issue Jun 6, 2023 · 1 comment
Open

Question for backbone #10

chaos1992 opened this issue Jun 6, 2023 · 1 comment

Comments

@chaos1992
Copy link

How can I use the clip-vit as the backbone? Which layer of the clip-vit is the 'feature_layer'?

@tgxs002
Copy link
Owner

tgxs002 commented Jul 18, 2023

Thank you for your interest in our work! The model is designed for CLIP versions that use ResNet as the backbone. A lot of changes need to be made to make it run for vision transformers. If you want to use the CLIP ViT as the backbone, I guess you need to use the output feature of the last layer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants