From a2efb133eaee4ed73830e424bbcc798a3cc23c42 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Max=20L=C3=BCbbering?= <2804731+le1nux@users.noreply.github.com> Date: Tue, 17 Sep 2024 15:03:00 +0200 Subject: [PATCH] Update CHANGELOG_DEV.md --- CHANGELOG_DEV.md | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) diff --git a/CHANGELOG_DEV.md b/CHANGELOG_DEV.md index 7d4fb0cd..36a41c9c 100644 --- a/CHANGELOG_DEV.md +++ b/CHANGELOG_DEV.md @@ -65,3 +65,23 @@ This [PR](https://github.com/Modalities/modalities/pull/236) removes all code re **Breaking changes:** * None +* + +## PR #254 Warmstart infrastructure switch + +This PR mainly addresses the warmstart of model training, e.g., after GPU crashes. + +**General Changes** +* Fixes issue #242 +* Warmstarts with changing infrastructure (e.g.,. different number of GPUs) are now supported. +* Restructures the settings part of the configs to +* Adds various checks for consistency of model training (e.g., target tokens and number of dataset tokens mismatch) +* Refactors all configs to be runnable again +* Adds an interactive jupyter notebook-based Tutorial on how to use Modalities. (merged from PR #239 ) +* Adds a warmstart tutorial +* TrainingReportGenerator that creates a report on the training setup and prints out warnings in case of inconsistencies. +* Activation Checkpointing is now a component +* Added further NumberConversion routines + +**Breaking Changes** +* the settings part of the configs have been completely refactored