Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

add log_training_trajectory option #549

Open
wants to merge 9 commits into
base: opt_instruct
Choose a base branch
from

Conversation

todpole3
Copy link
Contributor

if set, evaluate and save checkpoints more frequently in early stage of training

@todpole3 todpole3 force-pushed the opt_instruct_training_trajectory branch from 5cd7183 to 54b3e87 Compare December 11, 2022 10:50
@@ -418,6 +418,10 @@ def validate_and_save(
and num_updates >= cfg.dataset.validate_after_updates
and was_successful_step
)
or (
cfg.dataset.log_training_trajectory
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need to save? Just validate?

Copy link
Contributor Author

@todpole3 todpole3 Dec 12, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To plot the trajectory for generation tasks as well (ROUGE); eval will be fast

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants