Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue Continuing Training from a Previous Checkpoint #176

Closed
SebastianPaucar opened this issue Dec 13, 2024 · 2 comments
Closed

Issue Continuing Training from a Previous Checkpoint #176

SebastianPaucar opened this issue Dec 13, 2024 · 2 comments

Comments

@SebastianPaucar
Copy link

SebastianPaucar commented Dec 13, 2024

HI,

I would like to continue the training in a new run (not as the second stage of a first training). I tried changing the agent_file in my config.toml to the .chkpt generated in a previous run, but it doesn't work. What can I do?

Thanks in advance

@DAlvGar
Copy link

DAlvGar commented Dec 23, 2024

Hi Sebastian, I am not an expert with the tool but, what you describe is the way to go and it has worked for my work. You need a new config file that keeps the prior and sets the agent to the checkpoint file generated in any previous run. Please provide input files and error messages if you need more help.
Best,
Daniel

@halx
Copy link
Contributor

halx commented Dec 23, 2024

See #177

@halx halx closed this as completed Dec 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants