Skip to content

Restart a training job without the file trainer.pth #299

Discussion options

You must be logged in to vote

Basically, with your original YAML config (assuming you set seeds) you can deterministically reconstruct your training and validation sets, so all you have to do is make a copy and add the initialize_from_state model builder to start this new training session from the weights you still have in best_model.pth. (See #235 , #243, #205 for examples.)

You can also check your log/wandb for learning rate drops, and make sure to start at the learning rate you stopped at.

This won't give a perfect restart like trainer.pth would, but should still be enough to get reasonable results, unless I've forgotten something important...

Replies: 2 comments 6 replies

Comment options

You must be logged in to vote
3 replies
@LucaBrugnoli
Comment options

@Linux-cpp-lisp
Comment options

@Linux-cpp-lisp
Comment options

Comment options

You must be logged in to vote
3 replies
@Linux-cpp-lisp
Comment options

@LucaBrugnoli
Comment options

@Linux-cpp-lisp
Comment options

Answer selected by Linux-cpp-lisp
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants