You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Zarr is a storage layer for multidimensional numerical arrays. It allows array data to be chunked, compressed, and read to / written from a range of different storage mediums. Via gcsfs, zarr can store its data directly in google cloud storage buckets. Xarray recently gained the ability to read and write to zarr stores. We will need to figure out how to use zarr in Azure.
The following combination of packages will likely provide an ideal way to work with CAMHD datasets in the cloud: xarray + dask + zarr + gcsfs (or whatever the equivalent is on Azure).
Zarr is a storage layer for multidimensional numerical arrays. It allows array data to be chunked, compressed, and read to / written from a range of different storage mediums. Via gcsfs, zarr can store its data directly in google cloud storage buckets. Xarray recently gained the ability to read and write to zarr stores. We will need to figure out how to use zarr in Azure.
The following combination of packages will likely provide an ideal way to work with CAMHD datasets in the cloud: xarray + dask + zarr + gcsfs (or whatever the equivalent is on Azure).
To get spun up on zarr, take the following steps:
The text was updated successfully, but these errors were encountered: