Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How were the two models being created? #7

Open
dogma346 opened this issue Mar 29, 2022 · 1 comment
Open

How were the two models being created? #7

dogma346 opened this issue Mar 29, 2022 · 1 comment

Comments

@dogma346
Copy link

Hi, Sidak. My name is Terai, and I am an undergraduate student studying informatics engineering.

I have read your paper (titled as Model Fusion via Optimal Transport) published on NIPS'20. After reading through the paper, I have tried to reproduce the simulation results using the codes publicly available on GitHub, but I have two simple questions: How were the two models (used in Figure 2 of the paper) being created? How can I leverage your codes to retrain the models by myself?

I know you are busy, but I would greatly appreciate it if you could help me. Thanks.

@XueY-L
Copy link

XueY-L commented Apr 23, 2023

Hi, Sidak. My name is Terai, and I am an undergraduate student studying informatics engineering.

I have read your paper (titled as Model Fusion via Optimal Transport) published on NIPS'20. After reading through the paper, I have tried to reproduce the simulation results using the codes publicly available on GitHub, but I have two simple questions: How were the two models (used in Figure 2 of the paper) being created? How can I leverage your codes to retrain the models by myself?

I know you are busy, but I would greatly appreciate it if you could help me. Thanks.

Hi, Terai. I have also tried to reproduce the results. The resnet model may be trained using cifar/models/resnet.py. Please note that this model structure is different from that in torchvision.models. And the training hyperparameters are listed in cifar/hyperparameters. You can use train_cifar_models.py to train source models.

I have trained resnet models with BN layer and linear layer bias, while the provided models only contain the conv weights and linear weights. So I am not sure the actual model architecture during training.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants