[FEAT] Conformal Predictions in NeuralForecast #1171

JQGoh · 2024-10-03T01:19:13Z

Rationale and Changes

Conformal Prediction in NeuralForecast #995
Adopt the implementations in mlforecast for conformal predictions such as
- Add prediction (conformal) intervals mlforecast#86
- [FEAT] Add conformal distribution strategy mlforecast#97
Add tutorial on conformal predictions

Caveats

This does not support the dataframe of the type SparkDataFrame
Quantiled-type losses are not supported (do not conformalized various quantiled outputs)

review-notebook-app · 2024-10-03T01:19:18Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

elephaint · 2024-10-03T08:31:38Z

This is great, let me know if I can help you with this

neuralforecast/utils.py

JQGoh · 2024-10-03T15:21:11Z

@elephaint @marcopeix @jmoralez Please review

cc: @valeman This will be of your interest

neuralforecast/utils.py

elephaint

Awesome work! I did a first pass, I'll clone the branch tomorrow to take a deeper dive!

nbs/core.ipynb

nbs/docs/tutorials/20_conformal_prediction.ipynb

nbs/utils.ipynb

neuralforecast/core.py

wordings

Improve example

fix

neuralforecast/core.py

add argument to conformaliz quantiles if desired

JQGoh · 2024-10-07T15:02:34Z

@elephaint Please check the revised PR that I have omitted the UNSUPPORTED_LOSSES_CONFORMAL variable and introduced an optional argument to conformalize quantiles.

neuralforecast/core.py

elephaint

Great new stuff! I think most of the elements are there, I think on a more high-level I'm still a bit pondering whether this is the best way of including this in the API (I'll get back to that)

Couple of points for now:

Please remove the enable_quantiles everywhere, I think it's unnecessary and the distinction between various point losses is arbitrary;
Please remove the -conformal tag in the output names, this way the output names will be identical to normal DistributionLoss output names;
cross_validation needs the option for conformal intervals too;
The example needs a bit more work, for example below is a code snippet for creating a somewhat nicer plot

Example code in the tutorial (this already assumes the -conformal tag will be removed from the output name):

horizon = 12
input_size = 24

conformal_intervals = ConformalIntervals()

models = [NHITS(h=horizon, input_size=input_size, max_steps=100), NHITS(h=horizon, input_size=input_size, max_steps=100, loss=DistributionLoss("Normal", level=[90]))]
nf = NeuralForecast(models=models, freq='ME')
nf.fit(AirPassengersPanel_train, conformal_intervals=conformal_intervals)

fig, (ax1, ax2) = plt.subplots(2, 1, figsize = (20, 7))
plot_df = pd.concat([AirPassengersPanel_train, preds])

plot_df = plot_df[plot_df['unique_id']=='Airline1'].drop(['unique_id','trend','y_[lag12]'], axis=1).iloc[-50:]

ax1.plot(plot_df['ds'], plot_df['y'], c='black', label='True')
ax1.plot(plot_df['ds'], plot_df['NHITS1'], c='blue', label='median')
ax1.fill_between(x=plot_df['ds'][-12:], 
                 y1=plot_df['NHITS1-lo-90'][-12:].values,
                 y2=plot_df['NHITS1-hi-90'][-12:].values,
                 alpha=0.4, label='level 90')
ax1.set_title('AirPassengers Forecast', fontsize=18)
ax1.set_ylabel('Monthly Passengers', fontsize=15)
ax1.legend(prop={'size': 10})
ax1.grid()

ax2.plot(plot_df['ds'], plot_df['y'], c='black', label='True')
ax2.plot(plot_df['ds'], plot_df['NHITS'], c='blue', label='median')
ax2.fill_between(x=plot_df['ds'][-12:], 
                 y1=plot_df['NHITS-lo-90'][-12:].values,
                 y2=plot_df['NHITS-hi-90'][-12:].values,
                 alpha=0.4, label='level 90')
ax2.set_ylabel('Monthly Passengers', fontsize=15)
ax2.set_xlabel('Timestamp [t]', fontsize=15)
ax2.legend(prop={'size': 10})
ax2.grid()

nbs/core.ipynb

JQGoh · 2024-10-07T23:48:02Z

@elephaint Thanks for the detailed review.

cross_validation needs the option for conformal intervals too;

If I got this right, we want to enable users to call cross_validation and the prediction outputs can include prediction intervals by conformal.

However, my understanding is that to general conformal predictions:

First, compute conformity scores and store in _cs_df
Second, during the predictions, based on the stored conformity scores and given level, compute prediction intervals

It seems counter-intuitive to me that we can directly get conformal predictions by directly execute cross_validation directly. Hope to hear more about your elaborations on this.

PS: Even if we want to revise this, I suggest that could we do this revision in this subsequent PR?

elephaint · 2024-10-08T11:09:17Z

@JQGoh I pushed most of the suggested fixes already, saving us some time.

I think we still need:

Cross validation prediction intervals (we should be able to follow MLForecast example)
Some more tests (I need to think about the relevant tests)
Spark DataFrame support (?)

Other than that I'm really happy with what you made and I think we're almost there.

JQGoh · 2024-10-08T12:31:43Z

@JQGoh I pushed most of the suggested fixes already, saving us some time.

I think we still need:

Cross validation prediction intervals (we should be able to follow MLForecast example)

Some more tests (I need to think about the relevant tests)

Spark DataFrame support (?)

Other than that I'm really happy with what you made and I think we're almost there.

@elephaint Thanks for your help with the revision. I will think about the mentioned items. By the way, it appears that for now we only introduce conformal predictions to point loss, but not on the quantiled outputs? If that is the case, think we better mention this in the channel as previously I said that I wanted to introduce an optional parameter that supports this.

elephaint · 2024-10-08T12:40:19Z

@JQGoh I pushed most of the suggested fixes already, saving us some time.
I think we still need:

Cross validation prediction intervals (we should be able to follow MLForecast example)

Some more tests (I need to think about the relevant tests)

Spark DataFrame support (?)

Other than that I'm really happy with what you made and I think we're almost there.

@elephaint Thanks for your help with the revision. I will think about the mentioned items. By the way, it appears that for now we only introduce conformal predictions to point loss, but not on the quantiled outputs? If that is the case, think we better mention this in the channel as previously I said that I wanted to introduce an optional parameter that supports this.

Correct - users can still get prediction intervals over an arbitrary quantile output by fitting a model with e.g. QuantileLoss(q=0.8). Maybe we change this in the future, but it feels a bit too meta for now to include prediction intervals over prediction intervals.

JQGoh · 2024-10-08T14:36:48Z

@JQGoh I pushed most of the suggested fixes already, saving us some time.
I think we still need:

Cross validation prediction intervals (we should be able to follow MLForecast example)

Some more tests (I need to think about the relevant tests)

Spark DataFrame support (?)

Other than that I'm really happy with what you made and I think we're almost there.

@elephaint Thanks for your help with the revision. I will think about the mentioned items. By the way, it appears that for now we only introduce conformal predictions to point loss, but not on the quantiled outputs? If that is the case, think we better mention this in the channel as previously I said that I wanted to introduce an optional parameter that supports this.

Correct - users can still get prediction intervals over an arbitrary quantile output by fitting a model with e.g. QuantileLoss(q=0.8). Maybe we change this in the future, but it feels a bit too meta for now to include prediction intervals over prediction intervals.

That is indeed more neat than we provide conformal predictions on various quantile outputs (also agree that is too "meta"). Great improvement suggested by you 👍

nbs/core.ipynb

elephaint · 2024-10-10T15:26:47Z

@JQGoh Thanks again! I added some protections and moved the conformity score calculation to within the parts where we are able to do it. I think it will work like this, only the situation with a stored dataset (df=None) I haven't fully covered, although I'm not sure that's a major issue.

Let me know what you think.

JQGoh · 2024-10-10T23:32:29Z

@JQGoh Thanks again! I added some protections and moved the conformity score calculation to within the parts where we are able to do it. I think it will work like this, only the situation with a stored dataset (df=None) I haven't fully covered, although I'm not sure that's a major issue.

Let me know what you think.

@elephaint Thanks for adding the changes regarding the protection measures, LGTM.

elephaint · 2024-10-11T12:32:19Z

Ok, thanks again @JQGoh - I'm happy to merge

JQGoh · 2024-10-11T14:06:47Z

Ok, thanks again @JQGoh - I'm happy to merge

@elephaint
Thanks for your help with the user experience improvement🙏

neuralforecast/utils.py

neuralforecast/core.py

JQGoh added 5 commits October 1, 2024 14:34

Add ConformalIntervals class

fe65a1e

remove level parameter from conformalInterval class

df9528f

conformal fit and predict logic stored in utils file

6a4da12

Specify losses that do not support conformal prediction

e2b0c4a

conformal prediction integrated to NeuralForecast class

2ea83eb

JQGoh added 4 commits October 3, 2024 13:24

HuberQLoss does not support conformal prediction

6aaa80b

Remove unncessary illustration

20e1672

Add test for model saving & loading for conformal predictions

37d4f28

Fix model saving/loading missing conformal_intervals

c1140f6

JQGoh changed the title ~~Feat/conformal prediction~~ [FEAT] Conformal Predictions in NeuralForecast Oct 3, 2024

Add tutorial on conformal prediction

952c332

JQGoh commented Oct 3, 2024

View reviewed changes

neuralforecast/utils.py Show resolved Hide resolved

Fix attribute error during model saving

acd4f9c

JQGoh marked this pull request as ready for review October 3, 2024 15:21

JQGoh commented Oct 3, 2024

View reviewed changes

neuralforecast/utils.py Show resolved Hide resolved

elephaint requested changes Oct 3, 2024

View reviewed changes

JQGoh added 3 commits October 3, 2024 22:49

Review: clear core.ipynb output

0cf2802

Review: Corrections to undesired copy-and-paste notes; revision of

5eec585

wordings

Improve example

d514058

Improve example

JQGoh force-pushed the feat/conformal-prediction branch from ec6ea78 to d514058 Compare October 4, 2024 00:16

JQGoh added 2 commits October 4, 2024 13:52

Review: Use DFType instead

8e4ab58

fix

Improve example with better illustration

5503b6e

JQGoh commented Oct 6, 2024

View reviewed changes

neuralforecast/core.py Outdated Show resolved Hide resolved

JQGoh added 3 commits October 7, 2024 14:40

Review: Simply without using UNSUPORTED_LOSSED_CONFORMAL,

f497c54

add argument to conformaliz quantiles if desired

Revise example with the remark on conformalize_quantiles argument

d714e51

clean nbs/core.ipynb

be38777