Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A problem about training #1

Open
BrettFyt opened this issue Dec 30, 2022 · 5 comments
Open

A problem about training #1

BrettFyt opened this issue Dec 30, 2022 · 5 comments

Comments

@BrettFyt
Copy link

Hello, I ran the training script of BERT-BERT architecture in the example, but a poor simplification result is obtained. Is this a program error?

1672366702490

@amanbasu
Copy link
Owner

amanbasu commented Jan 8, 2023

For how many epochs did you train your model?

@BrettFyt
Copy link
Author

BrettFyt commented Jan 8, 2023

About 10-15 Epochs. Strangely, the system cannot output meaningful sentences. I also tried the BERT-GPT2 structure, which is better than BERT-BERT, but the SARI score is still not very good.

@amanbasu
Copy link
Owner

amanbasu commented Jan 8, 2023

If your SARI is not good, the output sentences won't make much sense.

I got a good SARI in 10-15 epochs. Are you using the Wikilarge dataset or the smaller one given in the repo? I would also suggest you to investigate the input sentences and label coming out of the data generator to verify if they are correct.

@BrettFyt
Copy link
Author

BrettFyt commented Jan 8, 2023

Thanks for the warm help. I downloaded the GitHub version code, and run the code according to the method on GitHub. I just set the Epoch to 15 and I used the training set which is in the dataset folder. I additionally checked the code of datagen.py and utils.py of the project, I think the code is OK.

@amanbasu
Copy link
Owner

amanbasu commented Jan 8, 2023

Check your loss, is it still decreasing when you reach 15 epochs or getting plateaued? You can try training it further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants