908: Flexible model column #915

sbfnk · 2024-09-18T14:09:45Z

Description

This PR closes #908. It also closes #914.

It removes all hard coding of the model column, which introduces breaking changes both in summarise_scores() and get_pairwise_comparison() as discussed in #908 (comment)

The change in summarise_scores() could be made unbroken by setting the default for by back to by = "model", perhaps for a transition period.

In get_pairwise_comparison() I realised this change by splitting the existing by argument into by and compare arguments, rather than (as previously) interpreting the first vector element to be special if equal "model" or otherwise not. This means that this can now make pairwise comparisons amongst everything but also that it breaks existing code (which could be mitigated by a default compare = "model" as above).

Checklist

My PR is based on a package issue and I have explicitly linked it.
I have included the target issue or issues in the PR title as follows: issue-number: PR title
I have tested my changes locally.
I have added or updated unit tests where necessary.
I have updated the documentation if required.
I have built the package locally and run rebuilt docs using roxygen2.
My code follows the established coding standards and I have run lintr::lint_package() to check for style issues introduced by my changes.
I have added a news item linked to this PR.
I have reviewed CI checks for this PR and addressed them as far as I am able.

sbfnk · 2024-09-18T15:24:47Z

R-CMD-check for v4.0 fails for unrelated reasons, reported at mitchelloharawild/distributional#128

R/summarise_scores.R

seabbs

I like compare alot and more generally like this change.

See suggestion for having "model" as a soft requirement and so using it as a default with appropriate warnings.

Does this change mean that all current forecast_units need to be updated to include "model"?

seabbs · 2024-09-18T15:49:44Z

We also might want to solicit some user feedback here.

sbfnk · 2024-09-18T16:12:03Z

Does this change mean that all current forecast_units need to be updated to include "model"?

Don't they already?

> get_forecast_unit(score(example_sample_continuous))
[1] "location"        "location_name"   "target_end_date" "target_type"    
[5] "forecast_date"   "model"           "horizon"

seabbs · 2024-09-18T16:17:28Z

isn't that because when it was saved model was being treated as a special var and we were inserting it into the forecast unit?

seabbs · 2024-09-18T16:20:08Z

The answer to my question is no we had no special handling for model in forecast unit and so this will work without changes (

scoringutils/R/convenience-functions.R

Line 276 in 2c363c8

set_forecast_unit <- function(data, forecast_unit) {

)

nikosbosse

Nice, thanks a lot!

My intuition is we might want to set "model" as the default for compare (or by, where appropriate) and just have an informative error if you don't have a "model" column available.
That would make things simple for everyone who has a model column and everyone else has to specify the argument anyway.

I don't have a very strong preference though. What do you think?

R/pairwise-comparisons.R

R/summarise_scores.R

seabbs · 2024-09-21T14:27:54Z

I don't have a very strong preference though. What do you think?

+1 for this option

nikosbosse · 2024-09-22T15:18:45Z

@sbfnk what do you think? Also I'm happy to do the leg-work of updating the PR if you prefer

sbfnk · 2024-09-23T09:52:29Z

So just to check we're all in agreement we want to have

summarise_scores(scores) ## summarise by model, error if column not present (1)
summarise_scores(scores, by = NULL) ## summarise across everything (2)
summarise_scores(scores, by = "target_type") ## summarise by target type (3)
summarise_scores(scores, by = c("model", "target_type")) ## summarise by model and target type (4)

I'd find a by = NULL default (which will work whatever the columns are) a more intuitive choice but am happy to make the change if agreement is that this is preferred.

seabbs · 2024-09-23T09:58:41Z

I think 2 as the default is nice if we think more users will just want to summarise a single model than will be annoyed by the breaking change of going from model as default to not.

I don't really have a strong view either way and we can always change this default latter so I don't see it as absolutely crucial to be 💯 on the right choice.

nikosbosse · 2024-09-23T19:43:23Z

So just to check we're all in agreement we want to have

I think the following should error:

summarise_scores(scores, by = NULL) ## summarise across everything (2)

It currently error as well, but we're not catching it correctly.

In summarise_scores() we'd need to update

assert_subset(by, names(scores), empty.ok = TRUE)

to

assert_subset(by, names(scores), empty.ok = FALSE)

and add e.g. the following test:

test_that("summarise_scores() errors if `by = NULL", {
  expect_error(
    summarise_scores(scores_quantile, by = NULL),
    "Assertion on 'by' failed: Must be a subset of"
  )
})

(EDIT: Just did that in #919)

Other than that I'd suggest to set/keep the following defaults:

summarise_scores <- function(scores, by = "model", fun = mean, ...)
get_pairwise_comparisons <- function(scores,  compare,  by = NULL/character(0))

for get_pairwise_comparisons: we usually use NULL for empty arguments. character(0) has the advantage that it can be directly plugged in split(), whereas with NULL we'd have to handle that. I'm still leaning slightly towards NULL as the default for consistency with other functions. Not a very strong preference though

Co-authored-by: Nikos Bosse <[email protected]>

* use neg_log_score * update docs, revert previous change * update docs

nikosbosse

Nice, this looks great!
I added a suggestion to correct a typo. Other than that only one minor bit:

get_pairwise_comparisons() has currently a default by = NULL
add_relative_skill() has a default by = character(0) - that should probably be changed to by = NULL

R/pairwise-comparisons.R

Co-authored-by: Nikos Bosse <[email protected]>

nikosbosse

Nice!

This change was introduced in epiforecasts/scoringutils#915 and subsequently broke our tests.

* remove `model` parameter from scoringutils call This change was introduced in epiforecasts/scoringutils#915 and subsequently broke our tests. * hide warnings from tests A whole bunch of new warnings started popping up because scoringutils was trying to take the mean of character columns. Wrapping everything in `suppressWarnings()` helps. * okay linter

sbfnk added 8 commits September 18, 2024 13:45

remove check for model column

1f65943

remove default from summarise scores

42d149c

split by argument to pairwise_comparison()

72b05c0

update unit tests

6cec8be

add global

f7ce235

update documentation

9c4d7ea

fix check errors

0258d86

update manuscript

3d0cca2

seabbs reviewed Sep 18, 2024

View reviewed changes

R/summarise_scores.R Outdated Show resolved Hide resolved

seabbs reviewed Sep 18, 2024

View reviewed changes

sbfnk added 2 commits September 19, 2024 09:59

add warnings if compare/by is unset

a6da128

update tests

bcc7d69

nikosbosse reviewed Sep 20, 2024

View reviewed changes

Merge branch 'main' into flexible-model-column

5567621

nikosbosse and others added 3 commits September 24, 2024 09:19

Merge branch 'main' into flexible-model-column

0d5c78f

add test and fix bug

d8e123a

improve explanation in the docs

d776de2

Co-authored-by: Nikos Bosse <[email protected]>

sbfnk mentioned this pull request Sep 25, 2024

remove check_columns_present #922

Open

sbfnk and others added 2 commits September 25, 2024 10:44

restore model default

2be8f53

fix names (#920)

0df4173

nikosbosse and others added 2 commits September 25, 2024 10:45

#896 Update documentation of log score (#918)

8244923

* use neg_log_score * update docs, revert previous change * update docs

Merge remote-tracking branch 'origin/main' into flexible-model-column

964e92c

sbfnk enabled auto-merge (squash) September 25, 2024 10:23

sbfnk requested a review from nikosbosse September 25, 2024 10:24

nikosbosse reviewed Sep 25, 2024

View reviewed changes

R/pairwise-comparisons.R Outdated Show resolved Hide resolved

R/pairwise-comparisons.R Outdated Show resolved Hide resolved

sbfnk and others added 2 commits September 25, 2024 17:48

fix typo

b178503

Co-authored-by: Nikos Bosse <[email protected]>

make NULL the default

2c5a298

sbfnk requested a review from nikosbosse September 25, 2024 16:49

render doc

26a6270

nikosbosse approved these changes Sep 25, 2024

View reviewed changes

sbfnk merged commit 06ab379 into main Sep 25, 2024
9 checks passed

sbfnk deleted the flexible-model-column branch September 25, 2024 17:15

zkamvar mentioned this pull request Sep 25, 2024

transform_pmf_model_out() broken by new scoringutils hubverse-org/hubEvals#56

Closed

zkamvar added a commit to hubverse-org/hubEvals that referenced this pull request Oct 3, 2024

remove model parameter from scoringutils call

0ab3702

This change was introduced in epiforecasts/scoringutils#915 and subsequently broke our tests.

zkamvar mentioned this pull request Oct 3, 2024

fix transform_pmf_model_out hubverse-org/hubEvals#57

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

908: Flexible model column #915

908: Flexible model column #915

sbfnk commented Sep 18, 2024 •

edited

Loading

sbfnk commented Sep 18, 2024

seabbs left a comment

seabbs commented Sep 18, 2024

sbfnk commented Sep 18, 2024

seabbs commented Sep 18, 2024

seabbs commented Sep 18, 2024

nikosbosse left a comment

seabbs commented Sep 21, 2024

nikosbosse commented Sep 22, 2024

sbfnk commented Sep 23, 2024

seabbs commented Sep 23, 2024

nikosbosse commented Sep 23, 2024 •

edited

Loading

nikosbosse left a comment

nikosbosse left a comment

908: Flexible model column #915

908: Flexible model column #915

Conversation

sbfnk commented Sep 18, 2024 • edited Loading

Description

Checklist

sbfnk commented Sep 18, 2024

seabbs left a comment

Choose a reason for hiding this comment

seabbs commented Sep 18, 2024

sbfnk commented Sep 18, 2024

seabbs commented Sep 18, 2024

seabbs commented Sep 18, 2024

nikosbosse left a comment

Choose a reason for hiding this comment

seabbs commented Sep 21, 2024

nikosbosse commented Sep 22, 2024

sbfnk commented Sep 23, 2024

seabbs commented Sep 23, 2024

nikosbosse commented Sep 23, 2024 • edited Loading

nikosbosse left a comment

Choose a reason for hiding this comment

nikosbosse left a comment

Choose a reason for hiding this comment

sbfnk commented Sep 18, 2024 •

edited

Loading

nikosbosse commented Sep 23, 2024 •

edited

Loading