-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Beit model and pretrained weights #245
base: main
Are you sure you want to change the base?
Conversation
1、当oneflow版本为0.8.1.dev20220802+cu112时,会报错:
修改oneflow版本为0.8.1+cu112.git.506cb3f1即可正常运行。
2、对beit_base_patch16_224、beit_base_patch16_384、beit_large_patch16_224、beit_large_patch16_384、beit_large_patch16_512进行测试,皆可正常运行。 对beit_base_patch16_224_in22k、beit_large_patch16_224_in22k进行测试,采用预训练时可正常运行。在不采用预训练直接调用模型时,无法直接设置num_classes参数。
报错信息:
若通过像采用预训练那样修改最后一层head层的输出,可正常运行。
3、还有一点我比较好奇的是,in22k结尾的网络最后一层head的输出为何是21841这么大,即使是预训练之后。(按理说通过imagenet预训练之后最后的输出都是1000类) |
你是用eager模式跑的吧,因为add op还没支持non-contiguous输入,所以non-contiguous输入的inplace add是不支持的,以前的版本能支持,但计算出来的结果肯定是错的。这个问题正在统一解决中。 |
Add Beit model and pretrained weights