Creating Various Checkpoint During Training #133
-
Hey! Is there a way to add multiple checkpoints during training? Say I want to train a model on 10,000 steps, but want to keep track of the changes every 1,000 steps without exclusively relying on the test images in the logs (i.e., by running X/Y plot charts for each model), how would I go about doing that? Any specific code I should add to the Runpod training flow? Thanks for this awesome training protocol! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
Yes there is a way. I've been doing runs where I have checkpoints saved at every 250 or 500 step intervals. go to the directory Open to edit the file at around line 106 you will see a block :
Change it so that it looks something like this :
There are two areas where checkpoints are saved:
Note : you may find that the folder That's it. Make sure you allocate enough drive space for your GPU instance. each checkpoint takes up about 2GB. |
Beta Was this translation helpful? Give feedback.
-
do you think is possible to modify the "Copy and name the checkpoint file" cell for copy all its checkpoints into trained models folder ? |
Beta Was this translation helpful? Give feedback.
Yes there is a way. I've been doing runs where I have checkpoints saved at every 250 or 500 step intervals.
go to the directory
Dreambooth-Stable-Diffusion/configs/stable-diffusion/
Open to edit the file
v1-finetune_unfrozen.yaml
at around line 106 you will see a block :
Change it so that it looks something like this :