-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About the prediction accuracy after training #9
Comments
@XinGP Generally, if you reduce the batch size, you should also reduce the learning rate (ref: https://arxiv.org/pdf/1812.01187.pdf). The relatively large LR will lead to unstable training. |
|
Have you found a suitable learning rate for batch_size 4 to close to the result which the author gives? @XinGP |
Yes, after reducing all learning rates by half, I can obtain validation set results similar to those in the paper |
Hello author! I reproduced your AV1 training code, adjusted the epoch to 50, and adjusted the batch_size to 4, with no other adjustments made. What is the reason for the significant difference in the validation results obtained after training compared to your validation model?
The text was updated successfully, but these errors were encountered: