A problem about training #1

BrettFyt · 2022-12-30T02:18:33Z

Hello, I ran the training script of BERT-BERT architecture in the example, but a poor simplification result is obtained. Is this a program error？

amanbasu · 2023-01-08T02:37:40Z

For how many epochs did you train your model?

BrettFyt · 2023-01-08T07:58:50Z

About 10-15 Epochs. Strangely, the system cannot output meaningful sentences. I also tried the BERT-GPT2 structure, which is better than BERT-BERT, but the SARI score is still not very good.

amanbasu · 2023-01-08T10:01:45Z

If your SARI is not good, the output sentences won't make much sense.

I got a good SARI in 10-15 epochs. Are you using the Wikilarge dataset or the smaller one given in the repo? I would also suggest you to investigate the input sentences and label coming out of the data generator to verify if they are correct.

BrettFyt · 2023-01-08T10:56:10Z

Thanks for the warm help. I downloaded the GitHub version code, and run the code according to the method on GitHub. I just set the Epoch to 15 and I used the training set which is in the dataset folder. I additionally checked the code of datagen.py and utils.py of the project, I think the code is OK.

amanbasu · 2023-01-08T14:07:50Z

Check your loss, is it still decreasing when you reach 15 epochs or getting plateaued? You can try training it further.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A problem about training #1

A problem about training #1

BrettFyt commented Dec 30, 2022

amanbasu commented Jan 8, 2023

BrettFyt commented Jan 8, 2023

amanbasu commented Jan 8, 2023

BrettFyt commented Jan 8, 2023

amanbasu commented Jan 8, 2023

A problem about training #1

A problem about training #1

Comments

BrettFyt commented Dec 30, 2022

amanbasu commented Jan 8, 2023

BrettFyt commented Jan 8, 2023

amanbasu commented Jan 8, 2023

BrettFyt commented Jan 8, 2023

amanbasu commented Jan 8, 2023