Adds warning message when deterministic training loss stagnates too quickly in partial BNNs #24

sarah-allec · 2024-11-05T21:36:12Z

Context

Certain deterministic NN hyperparameters may cause overfitting that manifests in the training loss initially decreasing very rapidly and stagnating early. A warning message to the user with suggested solutions would be helpful. Closes #11

Description

After the first epoch, the change in training loss is monitored and any time it drops more than 25% (see figure for justification of choice of 25%), a warning message is printed to the user:

UserWarning: The deterministic training loss is decreasing rapidly - learning and accuracy may be improved by increasing the batch size, adjusting MAP sigma, or modifying the learning rate.

Changes in the codebase

Added a function called monitor_dnn_loss in neurobayes/utils/utils.py that prints warning when loss has decreased by 25% at any epoch.
Added a call to monitor_dnn_loss in flax_nets/deterministic_nn.py in the training loop (DeterministicNN.train()).

…ses too quickly

ziatdinovmax

Looks good overall. A few suggestions:

Please clarify the monitor_dnn_loss function's return behavior - either add an explicit return value or document that it intentionally returns None.
Please Include a length check (if len(loss) > 2:) to avoid potential IndexError with np.diff for the edge case when loss has fewer than two elements.

… loss

sarah-allec · 2024-11-18T17:36:56Z

I implemented the suggestions - thank you! If everything looks good, I will submit the PR.

ziatdinovmax

Looks good! There is a seemingly unrelated issue with python-3.10 tests failing, which I will need to figure out later, but this one is ready to be merged.

sarah-allec added 5 commits November 5, 2024 10:34

added monitor_dnn_loss function to warn when DNN training loss decrea…

391d911

…ses too quickly

remove saving of losses to file

b163ff7

Updated to account for rapid drops in training loss over all of training

605273c

Changed stacklevel of warning to only print once

b824b18

Updated language of warning message to be more specific

8cf7508

ziatdinovmax reviewed Nov 15, 2024

View reviewed changes

Addresses suggestions to add None return type and check for length of…

326b7e1

… loss

ziatdinovmax marked this pull request as ready for review November 19, 2024 17:00

ziatdinovmax approved these changes Nov 19, 2024

View reviewed changes

ziatdinovmax merged commit e735fea into ziatdinovmax:main Nov 19, 2024
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds warning message when deterministic training loss stagnates too quickly in partial BNNs #24

Adds warning message when deterministic training loss stagnates too quickly in partial BNNs #24

sarah-allec commented Nov 5, 2024 •

edited

Loading

ziatdinovmax left a comment

sarah-allec commented Nov 18, 2024

ziatdinovmax left a comment

Adds warning message when deterministic training loss stagnates too quickly in partial BNNs #24

Adds warning message when deterministic training loss stagnates too quickly in partial BNNs #24

Conversation

sarah-allec commented Nov 5, 2024 • edited Loading

Context

Description

Changes in the codebase

ziatdinovmax left a comment

Choose a reason for hiding this comment

sarah-allec commented Nov 18, 2024

ziatdinovmax left a comment

Choose a reason for hiding this comment

sarah-allec commented Nov 5, 2024 •

edited

Loading