You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a request for evaluation of ordinal categorical predictions.
An example of predictions for an ordinal target can be found here. For that target, the category levels are "low", "moderate", "high", "very high".
To align with what's been proposed for nominal forecasts, we could have an input format as follows, where the predicted_label and observed columns are ordered factors:
Setting notation, let $f(k)$ and $F(k)$ be the submitted predictive pmf and the implied predictive cdf obtained via $F(k) = \sum_{j \leq k} f(k)$, with $K$ total categories so that $k \in {1, \ldots, K}$ and the observed value $y \in {1, \ldots, K}$. Additionally, adopt the convention that $f(0) = F(0) = 0$. Some scores/metrics that it would be nice to support for ordinal forecasts include:
log score: $\log[f(y)]$
ranked probability score: $\sum_j [F(j) - 1(y \leq j)]^2$, i.e. the sum across ordered category levels of the squared difference between the predictive cdf and the empirical cdf corresponding to a point mass at the observed category level
PIT values, with randomization: $F(k - 1) + U * f(k)$ where $U \sim Unif(0, 1)$
For both nominal and ordinal categorical forecasts, there are also all sorts of things based on summaries of confusion matrices, e.g. precision, recall, and F scores. For me personally those are less of a priority.
The text was updated successfully, but these errors were encountered:
Now that #837 this should be readily doable. @nikosbosse have you had any thoughts about if there is anything we can do here to reduce code duplication across nominal and ordinal predictions?
One option is just using internal functions fairly heavily and the other more complicated option is some kind of s3 class hierarchy but I am not totally sure that is worth it?
This is a request for evaluation of ordinal categorical predictions.
An example of predictions for an ordinal target can be found here. For that target, the category levels are
"low", "moderate", "high", "very high"
.To align with what's been proposed for nominal forecasts, we could have an input format as follows, where the
predicted_label
andobserved
columns are ordered factors:Setting notation, let$f(k)$ and $F(k)$ be the submitted predictive pmf and the implied predictive cdf obtained via $F(k) = \sum_{j \leq k} f(k)$ , with $K$ total categories so that $k \in {1, \ldots, K}$ and the observed value $y \in {1, \ldots, K}$ . Additionally, adopt the convention that $f(0) = F(0) = 0$ . Some scores/metrics that it would be nice to support for ordinal forecasts include:
For both nominal and ordinal categorical forecasts, there are also all sorts of things based on summaries of confusion matrices, e.g. precision, recall, and F scores. For me personally those are less of a priority.
The text was updated successfully, but these errors were encountered: