More additional tests on data in regional data #312

RichardMN · 2021-04-24T18:31:41Z

Implementation of more tests from #302, intended to be merged into #307

Checks that the number of regions at level 2 >= the number of regions at level 1
Checks that there is at most 1 level 1 region coded to NA
Checks that there is at most 1 level 2 region coded to NA
Checks that the number of level 1 regions is identical in download of level 1 and level 2 dat

This is a rough implementation.

It currently has a stand-alone wrapper to apply purrr because it only is run once for each country class, but then needs to be told what is the maximum level available from that country class. Notionally, this could allow us to go beyond level 2, but the code doesn't have that recursive flexibility yet. It may be possible to merge the wrapper with the existing file. I've removed the download option because these tests only make sense with download = TRUE.

The files and possibly the functions probably should be renamed.

It calls get_regional_data and I think it doing this since this is the "front end" for most users and there could still be glitches between the data class and the output of get_regional_data.

It's not quick.

Checks that the number of regions at level 2 >= the number of regions at level 1 Checks that there is at most 1 level 1 region coded to NA Checks that there is at most 1 level 2 region coded to NA Checks that the number of level 1 regions is identical in download of level 1 and level 2 dat

…region_checks

RichardMN · 2021-06-07T19:15:41Z

It's now hard to see from the comparison against the older version of the branch what this changes, but the changes are fairly minor. I think it probably useful to have roughly three separate strands of tests (which may not all be framed as tests):

unit tests - does the code do what we expect it to do under various (mostly canned) inputs, working mostly at small unit levels (we have this)
data availability tests - does our system successfully extract "some data" from the sources (we have this and run it nightly and I like the dashboard)
data sanity checks - does the data that our system pulls make some sort of sense, and has there possibly been some change to the underlying source which doesn't break the data availability test but may mean the information we are providing isn't reliable, whether for new data, or for older data, or for both

I think this PR and #307 are both trying to provide something like 3, and that it's useful to have. It's sort of a canary.

It's not necessary and we've done without it for a long time. I can tinker with my PR and am happy to try to work with @joseph-palmer on #307, but also content if the feeling right now is that this can be put aside in favour of other efforts.

github-actions · 2021-08-07T01:48:56Z

This PR has been flagged as stale due to lack of activity

joseph-palmer and others added 3 commits April 23, 2021 11:27

test that level 1 and level 2 regions are not all NA

fa7c252

Merge branch 'master' into additional_tests

e41b13c

RichardMN mentioned this pull request Apr 24, 2021

Additional tests for get_regional_data #302

Open

8 tasks

RichardMN added 3 commits April 24, 2021 21:45

Narrow to avoid non Country-specific classes

16f0f99

Qualifying select and filter in test code

c854d2d

Merge remote-tracking branch 'upstream/master' into additional_tests_…

32e2051

…region_checks

seabbs mentioned this pull request Jun 4, 2021

Additional tests #307

Closed

Adjust to only be run on regional type datasets

0801639

joseph-palmer mentioned this pull request Jun 14, 2021

[REVIEW]: covidregionaldata: Subnational data for COVID-19 epidemiology openjournals/joss-reviews#3290

Closed

40 tasks

github-actions bot added the no-pr-activity label Aug 7, 2021

seabbs changed the base branch from additional_tests to master February 4, 2022 15:03

Bisaloo self-assigned this Feb 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More additional tests on data in regional data #312

More additional tests on data in regional data #312

RichardMN commented Apr 24, 2021

RichardMN commented Jun 7, 2021

github-actions bot commented Aug 7, 2021

More additional tests on data in regional data #312

Are you sure you want to change the base?

More additional tests on data in regional data #312

Conversation

RichardMN commented Apr 24, 2021

RichardMN commented Jun 7, 2021

github-actions bot commented Aug 7, 2021