-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consolidating existing stores? #22
Comments
I think there's a third option of automated use of linked PR against existing stores? Where the input PCollection is just the zarr.storage.FSStore of the existing store. |
Also pangeo-forge/pangeo-forge-recipes#556 falls into this category. |
I think this is what I will do for the ClimSim |
I am afraid I do not quite understand what that third option is? |
Run a pipeline like this on Dataflow: from pangeo_forge_recipes.transforms import ConsolidateCoordinateDimensions
existing_paths: list[str] = get_existing_paths_from_bigquery(...)
def path_to_fsstore(path: str) -> zarr.storage.FSStore:
...
return store
recipe = (
beam.Create(existing_paths)
| beam.Map(path_to_fsstore)
| ConsolidateCoordinateDimensions()
) |
Ahhhhh, yes that makes sense. I could do that in retrospect once, and then add such a stage to new recipes. |
Still relevant. I am copying the successful ingestions over to the public buckets and catalog them in |
Just going through old issues. I think this might actually be addressed by our current QC (i.e. unconsolidated stores are not passing the tests?), but would need to check that |
Since we are currently not performing consolidation (waiting for pangeo-forge/pangeo-forge-recipes#575), we have two options for the future:
The text was updated successfully, but these errors were encountered: