Change directory structure to better accommodate different data sources and drivers #17

AndyMcAliley · 2022-03-24T20:07:16Z

Following up on this discussion, it will be cumbersome to add new data sources and drivers. The folders created when the pipeline is run could be fully organized by data source and driver type, but they are not.

For example, after the pipeline is run, the directory structure in 1_fetch looks like this:

├─out
├───dynamic_mntoha/
├───obs_mntoha/
├───lake_metadata.csv
├─tmp
├───dynamic_mntoha/
└───obs_mntoha/

Some issues with this organization system:

No subfolders in tmp/ or out/. At best, future data sources must be identified based on a suffix (e.g. _mntoha). At worst, there is no suffix (as with lake_metadata.csv), so the situation is ripe for file collisions that result in a file being overwritten or used for the wrong data source.
There's no distinction between NLDAS drivers and GCM drivers.

The text was updated successfully, but these errors were encountered:

AndyMcAliley · 2022-03-24T20:13:06Z

A better way to organize the files might be this:

├─out
├───mntoha
├─────nldas
├─────gcm_access
├─────gcm_gfdl
├─────gcm_... (four more GCM types)
├─────clarity
├─────ice_flags
├─────temperature_observations
├─────lake_metadata.csv
├───large_midwest_footprint
├─────nldas
├─────gcm_access
├─────gcm_gfdl
├─────gcm_... (four more GCM types)
├─────clarity
├─────ice_flags
├─────temperature_observations
├─────lake_metadata.csv
├─tmp
├───mntoha
├─────nldas
├─────gcm_access
├─────gcm_gfdl
├─────gcm_... (four more GCM types)
├─────clarity
├─────ice_flags
├─────temperature_observations
├───large_midwest_footprint
├─────nldas
├─────gcm_access
├─────gcm_gfdl
├─────gcm_... (four more GCM types)
├─────clarity
├─────ice_flags
├─────temperature_observations

lindsayplatt · 2022-03-28T15:47:01Z

I have been using the suffix/prefix approach over in lake-temperature-out. I'm not super satisfied by it because you end up having to scroll through a lot of files, so I like the idea of a nested approach!

AndyMcAliley self-assigned this Mar 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change directory structure to better accommodate different data sources and drivers #17

Change directory structure to better accommodate different data sources and drivers #17

AndyMcAliley commented Mar 24, 2022

AndyMcAliley commented Mar 24, 2022 •

edited

Loading

lindsayplatt commented Mar 28, 2022

Change directory structure to better accommodate different data sources and drivers #17

Change directory structure to better accommodate different data sources and drivers #17

Comments

AndyMcAliley commented Mar 24, 2022

AndyMcAliley commented Mar 24, 2022 • edited Loading

lindsayplatt commented Mar 28, 2022

AndyMcAliley commented Mar 24, 2022 •

edited

Loading