'Recent' metrics: Should Metrics be Link-able? #498

pjz · 2021-03-04T20:26:42Z

pjz
Mar 4, 2021

Given that in many models the influence of older data tends to lessen, shouldn't we be able to apply the same logic to our metrics? For instance, the Accuracy of a model is somewhat interesting, but the recent accuracy is more interesting than the total accuracy. I thus thought I'd be able to do something like

recent_accuracy = metrics.Accuracy() | stats.EWMean()

to get a measure of recent accuracy, but alas, Accuracy is not a Link. So I'm reduced to doing a Rolling Accuracy over a kind of arbitrarily sized chosen 'recent' sample size. Is there something better? Am I Doing It Wrong?

Answered by raphaelsty

Mar 4, 2021

To calculate the accuracy with an exponential average I would do it like this:

from river import stats

metric = stats.EWMean(alpha = 0.5)

y_pred = [1, 1, 1, 1, 1]
y_true = [0, 0, 1, 1, 1]

for pred, true in zip(y_pred, y_true):
    metric.update(pred == true)
    
print(metric.get())

1.0

So I'm reduced to doing a Rolling Accuracy over a kind of arbitrarily sized chosen 'recent' sample size. You will choose an equally arbitrary alpha coefficient for the exponential average. Personally, I find the RollingAccuracy more easily interpretable.

from river import metrics

metric = metrics.Rolling(metrics.Accuracy(), window_size=3)

y_pred = [1, 1, 1, 1, 1]
y_true = [0, 0, 1, 1, 1]

for pred, true

View full answer

raphaelsty · 2021-03-04T21:06:52Z

raphaelsty
Mar 4, 2021
Maintainer

To calculate the accuracy with an exponential average I would do it like this:

from river import stats

metric = stats.EWMean(alpha = 0.5)

y_pred = [1, 1, 1, 1, 1]
y_true = [0, 0, 1, 1, 1]

for pred, true in zip(y_pred, y_true):
    metric.update(pred == true)
    
print(metric.get())

1.0

So I'm reduced to doing a Rolling Accuracy over a kind of arbitrarily sized chosen 'recent' sample size. You will choose an equally arbitrary alpha coefficient for the exponential average. Personally, I find the RollingAccuracy more easily interpretable.

from river import metrics

metric = metrics.Rolling(metrics.Accuracy(), window_size=3)

y_pred = [1, 1, 1, 1, 1]
y_true = [0, 0, 1, 1, 1]

for pred, true in zip(y_pred, y_true):
    metric.update(y_pred = pred, y_true = true)

Rolling of size 3 Accuracy: 100%

Raphaël

0 replies

MaxHalford · 2021-03-04T22:50:26Z

MaxHalford
Mar 4, 2021
Maintainer

Hey there. As @raphaelsty suggests (and it seems you already know) you can wrap a metric with Rolling to calculate the metric over window.

We could in theory allow you to compute exponentially weighted averages for metrics that are decomposable, but how would that be interpreted? I find it really confusing to say that the "exponentially weighted accuracy is equal to x". On the contrary, communicating the accuracy over a window of fixed size is much easier to interpret. What do you think?

1 reply

pjz Mar 7, 2021
Author

We could in theory allow you to compute exponentially weighted averages for metrics that are decomposable, but how would that be interpreted?

Accuracy with online learning feels a bit slippery to me because the model at the end isn't really the same model you had at the beginning - it has learned along the way. Thus recent values for accuracy would seem to be more valuable for comparing how well a model is doing than total values. An EWA seems like a bit of a better fit to me than an equally weighted window, though I'll admit that alpha selection is similar to window selection... but still without the weighting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

'Recent' metrics: Should Metrics be Link-able? #498

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

'Recent' metrics: Should Metrics be Link-able? #498

pjz Mar 4, 2021

Replies: 2 comments · 1 reply

raphaelsty Mar 4, 2021 Maintainer

MaxHalford Mar 4, 2021 Maintainer

pjz Mar 7, 2021 Author

pjz
Mar 4, 2021

Replies: 2 comments 1 reply

raphaelsty
Mar 4, 2021
Maintainer

MaxHalford
Mar 4, 2021
Maintainer

pjz Mar 7, 2021
Author