Issue #585: allow users to specify columns and forecast unit in `as_forecast()` #641

nikosbosse · 2024-02-21T00:43:27Z

Description

This PR closes #585

As discussed in #585 it is convenient for users to be able to specify the forecast unit, as well as changes to column names they need to make as part of as_forecast().

This PR

adds additional arguments observed, predicted, model, forecast_unit, quantile_level, sample_id to as_forecast() that allow the users to specify the desired forecast unit as well as desired changes to column names
adds checks to validate the inputs to as_forecast()
adds tests to check the behaviour is as expected
updates the NEWS file

Additional thoughts and considerations:

the order of the arguments could be different. For example, we might want forecast_unit to be the first argument. Strong opinions? At the moment I put it there because I felt it was more natural to first specify the columns to be renamed and then the forecast unit and then the special columns. But 🤷
We could in principle create extra methods for sample and quantile-based forecasts (and then move the arguments sample_id and quantile_level to those methods. As mentioned in Discussion: Let as_forecast explicitly specify column names from user inputted data. #585 I feel this would lead to unnecessary complexity (having to call as_forecast() --> as_forecast.default() --> as_forecast() --> as_forecast.forecast_sample() just to hide an argument that is clearly explained in the docs).

Checklist

My PR is based on a package issue and I have explicitly linked it.
I have included the target issue or issues in the PR title as follows: issue-number: PR title
I have tested my changes locally.
I have added or updated unit tests where necessary.
I have updated the documentation if required.
I have built the package locally and run rebuilt docs using roxygen2.
My code follows the established coding standards and I have run lintr::lint_package() to check for style issues introduced by my changes.
I have added a news item linked to this PR.
I have reviewed CI checks for this PR and addressed them as far as I am able.

codecov · 2024-02-21T00:48:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 87.73%. Comparing base (4d3f003) to head (23b708e).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #641      +/-   ##
==========================================
+ Coverage   87.53%   87.73%   +0.20%     
==========================================
  Files          21       21              
  Lines        1757     1786      +29     
==========================================
+ Hits         1538     1567      +29     
  Misses        219      219

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

R/validate.R

seabbs

Aside from my comment about the default args I like this as is. Happy to watch the discussion play out.

toshiakiasakura · 2024-02-23T09:59:25Z

R/validate.R

+as_forecast.default <- function(data,
+                                observed = NULL,
+                                predicted = NULL,
+                                model = NULL,
+                                forecast_unit = NULL,
+                                quantile_level = NULL,
+                                sample_id = NULL,
+                                ...) {


Is there any reason to accept other arguments by ...? In case users misspecify the argument name, it seems better to me to omit ... to decline every argument.

Is it better to include type = NULL argument if the user want to be sure about the forecast type? For example, if type="quantile" is specified, we raise an error if the quantile_level argument is not given, or simply raise an error if the returned class is not matched with this argument. I think this is related to #603.

Ah those are good points!

I think as_forecast.default() needs ... because the generic has it and the method and the generic always need to have the same arguments. But I like the idea of having a forecast_type argument

Is that true for ... args though? I'm not sure it is.

Just checked, it's true for ... args as well:

checking S3 generic/method consistency (1.7s) as_forecast: function(data, ...) as_forecast.default: function(data, forecast_unit, forecast_type, observed, predicted, model, quantile_level, sample_id) See section ‘Generic functions and methods’ in the ‘Writing R Extensions’ manual.

So instead, is it better to implement the error function for unexpected arguments manually?
Like the below for the beginning of the function.

extra_args <- setdiff(names(list(...)), names(formals(as_forecast.default))) if (length(extra_args) > 0) { stop(paste("Unknown argument(s):", paste(extra_args, collapse = ", "))) } # Curent implementation.

I am also happy with as it is!

Hm. Since additional arguments just have no effect at all, I would personally not throw an error for this. Given that the ... are there I would not expect an error as a user if I provide an additional argument.

nikosbosse · 2024-02-23T14:20:10Z

I just pushed a new commit that updates the documentation a bit.

Open questions (either before merging this PR or for a new PR):

should as_forecast() have a forecast_type argument? This could be used to warn the user if their forecast type does not match the one they want. In the case of binary/point this may be useful in particular
do we want the arguments to be observed = NULL or observed = "observed" (see discussion above)

seabbs · 2024-02-23T14:51:34Z

I agree on making it easy for the user here though good to have a back and forth on the different options.

I like the idea of allowing people to manually specify the type as a safety check.

I think either default arg option is fine so happy to leave as is

nikosbosse · 2024-02-23T18:26:53Z

Perfect. I made the following updates:

as_forecast() now has an argument forecast_type
added documentation and updated tests for that

Also note that compared to the very first proposal, I changed the order of the arguments, which now is

as_forecast.default <- function(data,
                                forecast_unit = NULL,
                                forecast_type = NULL,
                                observed = NULL,
                                predicted = NULL,
                                model = NULL,
                                quantile_level = NULL,
                                sample_id = NULL,
                                ...) {

It felt more natural that way as forecast_unit() and forecast_type() are the ones you should use almost every time + then we have all arguments related to renaming stuff lumped together.

nikosbosse · 2024-02-23T18:57:59Z

Unsure where the failing snapshots on macOS-latest are coming from. Can't reproduce that locally...
Seems like some update somewhere outside of our control triggered this... Even old tests that were previously passing fail now. I guess we just have to wait?

nikosbosse · 2024-02-24T00:22:16Z

Update: given that macOS-latest is failing on main as well I'd be tempted to merge in this PR and #633 regardless...
That would allow me to keep developing :)

seabbs · 2024-02-26T10:38:55Z

This snapshot failure is due to the recent ggplot2 update. I think we can ignore it.

R/validate.R

nikosbosse · 2024-02-26T16:56:25Z

Are we happy to merge as is (bypassing branch protection)?

nikosbosse requested review from seabbs and toshiakiasakura February 21, 2024 02:46

This comment was marked as resolved.

Sign in to view

seabbs reviewed Feb 22, 2024

View reviewed changes

R/validate.R Show resolved Hide resolved

seabbs approved these changes Feb 22, 2024

View reviewed changes

This comment was marked as resolved.

Sign in to view

toshiakiasakura reviewed Feb 23, 2024

View reviewed changes

This comment was marked as resolved.

Sign in to view

seabbs reviewed Feb 26, 2024

View reviewed changes

R/validate.R Show resolved Hide resolved

nikosbosse and others added 8 commits February 26, 2024 22:42

Update functionality of as_forceast()

b695529

Update tests

af48859

Update readme

64a3e89

Update News

e0c4eb4

Automatic readme update [ci skip]

3c95baa

Nonsensical commit to trigger CI changes

ecad6f2

Improve documentation

00952b5

Add an argument forecast_type to as_forecast()

842feac

Fix linting issue, update docs

23b708e

seabbs force-pushed the update-as_forecast() branch from 393cc54 to 23b708e Compare February 26, 2024 22:42

nikosbosse merged commit 66f139f into main Feb 26, 2024
10 of 12 checks passed

nikosbosse deleted the update-as_forecast() branch February 26, 2024 23:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #585: allow users to specify columns and forecast unit in `as_forecast()` #641

Issue #585: allow users to specify columns and forecast unit in `as_forecast()` #641

nikosbosse commented Feb 21, 2024 •

edited

Loading

codecov bot commented Feb 21, 2024 •

edited

Loading

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

seabbs left a comment

This comment was marked as resolved.

toshiakiasakura Feb 23, 2024 •

edited

Loading

nikosbosse Feb 23, 2024

nikosbosse Feb 23, 2024 •

edited

Loading

seabbs Feb 26, 2024

nikosbosse Feb 26, 2024

toshiakiasakura Feb 26, 2024

nikosbosse Feb 26, 2024

This comment was marked as resolved.

nikosbosse commented Feb 23, 2024 •

edited by seabbs

Loading

seabbs commented Feb 23, 2024

nikosbosse commented Feb 23, 2024

nikosbosse commented Feb 23, 2024 •

edited

Loading

nikosbosse commented Feb 24, 2024

seabbs commented Feb 26, 2024

nikosbosse commented Feb 26, 2024

Issue #585: allow users to specify columns and forecast unit in as_forecast() #641

Issue #585: allow users to specify columns and forecast unit in as_forecast() #641

Conversation

nikosbosse commented Feb 21, 2024 • edited Loading

Description

Checklist

codecov bot commented Feb 21, 2024 • edited Loading

Codecov Report

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

seabbs left a comment

Choose a reason for hiding this comment

This comment was marked as resolved.

toshiakiasakura Feb 23, 2024 • edited Loading

Choose a reason for hiding this comment

nikosbosse Feb 23, 2024

Choose a reason for hiding this comment

nikosbosse Feb 23, 2024 • edited Loading

Choose a reason for hiding this comment

seabbs Feb 26, 2024

Choose a reason for hiding this comment

nikosbosse Feb 26, 2024

Choose a reason for hiding this comment

toshiakiasakura Feb 26, 2024

Choose a reason for hiding this comment

nikosbosse Feb 26, 2024

Choose a reason for hiding this comment

This comment was marked as resolved.

nikosbosse commented Feb 23, 2024 • edited by seabbs Loading

seabbs commented Feb 23, 2024

nikosbosse commented Feb 23, 2024

nikosbosse commented Feb 23, 2024 • edited Loading

nikosbosse commented Feb 24, 2024

seabbs commented Feb 26, 2024

nikosbosse commented Feb 26, 2024

Issue #585: allow users to specify columns and forecast unit in `as_forecast()` #641

Issue #585: allow users to specify columns and forecast unit in `as_forecast()` #641

nikosbosse commented Feb 21, 2024 •

edited

Loading

codecov bot commented Feb 21, 2024 •

edited

Loading

toshiakiasakura Feb 23, 2024 •

edited

Loading

nikosbosse Feb 23, 2024 •

edited

Loading

nikosbosse commented Feb 23, 2024 •

edited by seabbs

Loading

nikosbosse commented Feb 23, 2024 •

edited

Loading