[FIX] Unify API #1023

elephaint · 2024-05-31T17:10:00Z

This is a large refactoring PR and open for discussion. The main goal of the PR is to unify API across different model types, and unify loss functions across different loss types.

Refactoring:

Fuses BaseWindows, BaseMultivariate and BaseRecurrent into BaseModel, removing the need for separate classes and unifying model API across different model types. Instead, this PR introduces two model attributes, yielding four possible model options: RECURRENT (True/False) and MULTIVARIATE (True/False). We currently have a model for every combination except a recurrent multivariate model (e.g. a multivariate LSTM), however this is now relatively simple to add. In addition, this change allows to have models that can be recurrent or not, or multivariate or not on-the-fly, based on users' input. This also allows for easier modelling going forward.
Unifies model API across all models, adding missing input variables to all model types.
Refactors losses, a.o. removing unnecessary domain_map functions.
Moves loss.domain_map outside of models to BaseModel
Moves RevINMultivariate used by TSMixer, TSMixerx and RMoK to common.modules

Features:

All losses compatible with all types of models (e.g. univariate/multivariate, direct/recurrent) OR appropriate protection added.
DistributionLoss now supports the use of quantiles in predict, allowing for easy quantile retrieval for all DistributionLosses.
Mixture losses (GMM, PMM and NBMM) now support learned weights for weighted mixture distribution outputs.
Mixture losses now support the use of quantiles in predict, allowing for easy quantile retrieval.
Improved stability of ISQF by adding softplus protection around some parameters instead of using .abs
Unified API for any quantile or any confidence level during predict for both point- and distribution losses.

Bug fixes:

MASE loss now works.
Added various protections around parameter combinations that are invalid (e.g. regarding losses).
StudentT increase default DoF to 3 to reduce unbound variance issues.
All models are now tested using a test function on the AirPassengers dataset; in most models we included eval: false on the examples whilst not having any other tests, causing most models to effectively not being tested at all.
IQLoss doesn't give monotonic quantiles, now it does (by quantiling the quantiles)
When training with both a conformal method and non-conformal method, the latter is also cross-validated to compute conformity scores. This is a redundant training step, and removed in this PR.

Breaking changes:

Rewrite of all recurrent models to get rid of the quadratic (in the sequence dimension) space complexity. As a result, it is impossible to load a recurrent model from a previous version into this version.
Recurrent models now require an input_size to be given.
TCN and DRNN are now windows models, not recurrent models.

Tests:

Added common._model_checks.py that includes a model testing function. This function runs on every separate model, ensuring that every model is tested on push.

Todo:

Test models on speed/scaling as compared to current implementation across a set of datasets.
Make sure docstring of all multivariate models is updated to reflect the additional inputs

review-notebook-app · 2024-05-31T17:10:10Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…neuralforecast into fix/docs_and_refactoring

marcopeix · 2024-10-15T14:24:56Z

I ran my own small experiment, and models are slightly faster, with no degradation in performance. The refactoring looks good to me, but a second opinion would be great on this!

elephaint · 2024-10-15T15:45:51Z

I ran my own small experiment, and models are slightly faster, with no degradation in performance. The refactoring looks good to me, but a second opinion would be great on this!

Thanks!

AzulGarza · 2024-10-18T16:00:14Z

action_files/test_models/src/evaluation.py

i noticed that this file and action_files/test_models/src/evaluation2.py are quite similar. i have a couple of suggestions:

this might be a good opportunity to use the utilsforecast evaluation features. we could replace mae and smape from the losses module and the evaluate function.

also, it looks like the only difference between this file and the second one is the list of models, correct? if that’s the case, we could combine them into a single file and use fire to pass the list of models. you could then call it in the .github/workflows/ci.yaml file.

the idea would be to abstract the code in the if __name__ == '__main__': clause, something like this:

def main(models: list): groups = ['Monthly'] datasets = ['M3'] evaluation = [evaluate(model, dataset, group) for model, group in product(models, groups) for dataset in datasets] evaluation = [eval_ for eval_ in evaluation if eval_ is not None] evaluation = pd.concat(evaluation) evaluation = evaluation[['dataset', 'model', 'time', 'mae', 'smape']] evaluation['time'] /= 60 # minutes evaluation = evaluation.set_index(['dataset', 'model']).stack().reset_index() evaluation.columns = ['dataset', 'model', 'metric', 'val'] evaluation = evaluation.set_index(['dataset', 'metric', 'model']).unstack().round(3) evaluation = evaluation.droplevel(0, 1).reset_index() evaluation['AutoARIMA'] = [666.82, 15.35, 3.000] evaluation.to_csv('data/evaluation.csv') print(evaluation.T)

and then you could use fire inside the main clause:

if __name__ == '__main__': import fire fire.Fire(main)

this way, we can run it for different models inside .github/workflows/ci.yaml: python -m action_files.test_models.src.evaluation --models <list of models>. wdyt?

this also could apply to action_files/test_models/src/multivariate_evaluation.py. since we are changing models and datasets, we could define main(models: list, dataset: str).

Good idea, one remarkt - why favour ci over circleci? (I'm ambivalent, don't know why we would prefer one over the other)

AzulGarza · 2024-10-18T16:17:28Z

action_files/test_models/src/models.py

maybe we could also merge action_files/test_models/src/models.py, action_files/test_models/src/models2.py, and action_files/test_models/src/multivariate_models.py. from what i can tell, the only real difference between them is the list of models. if that’s the case, we could add a new parameter to main—maybe config: str or something similar—and then have a dictionary with different models based on the config. for example: {"multivariate": , ...}, and then we could call it like this: python -m action_files.test_models.src.models --config multivariate.

let me know what you think.

Makes sense, then we can still fire up multiple runners (for the sake of keeping test time under control it makes sense to split the tests)

AzulGarza · 2024-10-18T16:18:32Z

action_files/test_models/src/models2.py

+# from neuralforecast.models.rnn import RNN
+# from neuralforecast.models.tcn import TCN
 from neuralforecast.models.lstm import LSTM
 from neuralforecast.models.dilated_rnn import DilatedRNN
-from neuralforecast.models.deepar import DeepAR
-from neuralforecast.models.mlp import MLP
-from neuralforecast.models.nhits import NHITS
-from neuralforecast.models.nbeats import NBEATS
+# from neuralforecast.models.deepar import DeepAR
+# from neuralforecast.models.mlp import MLP
+# from neuralforecast.models.nhits import NHITS
+# from neuralforecast.models.nbeats import NBEATS


is the commented code going to be restored in the future? if this change is permanent, maybe we could delete those lines instead.

I'm kind of treating this file also as a testing file locally, we can delete it (it's mainly so that testing locally is faster that you don't need to type all that stuff every time)

AzulGarza · 2024-10-18T16:34:04Z

nbs/common.base_model.ipynb

+    "            n_series = 1\n",
+    "        self.n_series = n_series          \n",


should we set n_series = None in this case? using n_series = 1 might lead to unexpected bugs, though i'm not entirely sure if that's the case here.

We need n_series=1 for the univariate models in a few other places, but I can also introduce additional helper variables to deal with that situation, but that feels a bit unnecessary perhaps....

draft

72771c0

next_iteration

dd9f26e

candalfigomoro mentioned this pull request Jun 6, 2024

[Core] Categorical Features #916

Open

elephaint and others added 18 commits June 9, 2024 02:10

next_iter

e0ee8d1

next_iteration

e7bbf30

next_iter

3419432

draft

75bea55

next_iteration

ef019d1

next_iter

4313c13

next_iteration

8101656

next_iter

14fbf32

merge_main

ae6d73c

fix_iql_and_isqf

302489e

fix_mixture_losses

0dcb6a2

add_quantile_to_distributionloss_predict

9160647

add_quantile_to_mixture_loss_predict

b73c097

bugfixes

20c18c5

fix_bugs

a26ac29

fix_multivariate_bugs

b20fe3f

Merge branch 'main' into fix/docs_and_refactoring

1070f1d

fix_json

452388f

elephaint marked this pull request as ready for review July 15, 2024 18:48

elephaint requested review from jmoralez and cchallu July 15, 2024 18:48

elephaint linked an issue Jul 22, 2024 that may be closed by this pull request

样本内预测predict_insample无法使用 #1056

Open

elephaint and others added 3 commits July 22, 2024 20:47

Merge branch 'main' into fix/docs_and_refactoring

2d3762f

Merge branch 'main' into fix/docs_and_refactoring

2419eb5

merge_main

bffa8d1

carusyte mentioned this pull request Aug 27, 2024

样本内预测predict_insample无法使用 #1056

Open

elephaint and others added 13 commits October 10, 2024 14:40

improve_speed_recurrent_models

baf7014

Merge branch 'main' into fix/docs_and_refactoring

fffbda3

improve_speed_tcn

9c727cc

Merge branch 'fix/docs_and_refactoring' of https://github.com/Nixtla/…

1a0ba55

…neuralforecast into fix/docs_and_refactoring

windows_without_contiguous

6bb64be

Merge branch 'main' into fix/docs_and_refactoring

a8a9362

merge_main

d6e24de

try_improve_nhits_bitcn_speed

430732f

reduce_test_time_models

6a472dc

improve_losses

ae49324

Merge branch 'main' into fix/docs_and_refactoring

d681fdf

change_forward_to_call_losses

abe522b

Merge branch 'main' into fix/docs_and_refactoring

932fd55

elephaint and others added 6 commits October 15, 2024 21:00

Merge branch 'main' into fix/docs_and_refactoring

1f52b8e

fix_linting

0b980c0

unify_quantile_and_level_in_predict

63984e6

Merge branch 'main' into fix/docs_and_refactoring

bbea7ba

fix_parameter_errors

6f2272c

rework_conformal

a4c8b54

elephaint requested a review from AzulGarza October 17, 2024 14:24

elephaint changed the title ~~[FIX] Code refactoring~~ [FIX] Unify API Oct 17, 2024

elephaint and others added 3 commits October 17, 2024 16:31

quantile_maybe_used

8ee4592

fix_non_monotonic_iq_loss_and_redundant_cv_conformal

96ab536

Merge branch 'main' into fix/docs_and_refactoring

030dabe

AzulGarza reviewed Nov 4, 2024

View reviewed changes

elephaint added 3 commits November 19, 2024 16:38

fix_batch_size_max_multivariate

ddc617f

merge_main

c529ced

fix_base_model

b2c7691

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Unify API #1023

[FIX] Unify API #1023

elephaint commented May 31, 2024 •

edited

Loading

review-notebook-app bot commented May 31, 2024

marcopeix commented Oct 15, 2024

elephaint commented Oct 15, 2024

AzulGarza Oct 18, 2024

AzulGarza Oct 18, 2024

elephaint Nov 6, 2024 •

edited

Loading

AzulGarza Oct 18, 2024

elephaint Nov 6, 2024

AzulGarza Oct 18, 2024

elephaint Nov 6, 2024

AzulGarza Oct 18, 2024

elephaint Nov 6, 2024

[FIX] Unify API #1023

Are you sure you want to change the base?

[FIX] Unify API #1023

Conversation

elephaint commented May 31, 2024 • edited Loading

review-notebook-app bot commented May 31, 2024

marcopeix commented Oct 15, 2024

elephaint commented Oct 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elephaint Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elephaint commented May 31, 2024 •

edited

Loading

elephaint Nov 6, 2024 •

edited

Loading