Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide data package for running the regression test #101

Open
pdebuyl opened this issue Oct 4, 2024 · 3 comments
Open

Provide data package for running the regression test #101

pdebuyl opened this issue Oct 4, 2024 · 3 comments

Comments

@pdebuyl
Copy link
Contributor

pdebuyl commented Oct 4, 2024

It would be extremely nice if a sample of data to run ORAC would be provided.

Getting the correct set of input files (emissivity, etc) for a specific scan is not that trivial and it would allow the user to focus on making sure that they have compiled and configured the software properly.

The data for the regression.py program + maybe a few examples (including a GEO) would be nice. I am willing to provide a full configuration for SEVIRI.

It is possible to upload a few GB to a zenodo entry, which could be pointed to in the ORAC documentation.

@adamcpovey
Copy link
Collaborator

I agree that would be nice. Legally, we can't distribute someone else's data. I already have a folder that contains all of the files necessary to run regression.py so if someone else wants to work out the licences, I can package it.

@pdebuyl
Copy link
Contributor Author

pdebuyl commented Oct 4, 2024

For the purpose of running tests of ORAC, you could probably have the authorization to provide a copy of one dataset (I mean, one granule/scan/etc). @simonrp84 did this for his PyCoxMunk implementation and provides one SEVIRI file here https://zenodo.org/records/7886737

A lot of the american (NASA/NOAA) data is public domain, for instance. As a EUMETSAT member (at work I mean), I can contact them to ask for the authorization.

NOAA AVHRR is under the "Level-1 and Atmosphere Archive & Distribution System" DAAC: can't find the direct download link but the policy is here: https://modaps.modaps.eosdis.nasa.gov/services/faq/LAADS_Data-Use_Citation_Policies.pdf

Atmospheric Science Data Center requests citation but does not limit distrbution: https://asdc.larc.nasa.gov/citing-data

CAMEL is under the "land processes" DAAC: https://lpdaac.usgs.gov/products/cam5k30cfv003/ -- no restriction: https://lpdaac.usgs.gov/data/data-citation-and-policies/

@simonrp84
Copy link
Collaborator

Under article 10.2 of the license agreement I believe that you and I are allowed to redistribute any original numerical SEVIRI data, Pierre. Adam and the others are only allowed to distribute the "core" SEVIRI data, which is the hourly L1.5 data.

The NASA ancillary data used by ORAC is, to my knowledge, freely redistributable. As is the ERA5 data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants