Replies: 2 comments
-
I think it would be great to have a plugin mechanism for the dataset, similar to the model. When I used the codebase in a new computing environment, this was a large part of the integration. Such a mechanism would be very useful. Do you think it would be worth supporting the ERA5 mirror in the format it's stored in here? Because this data has a globus endpoint, it's the most common mirror of ERA5 that I have seen on a few different computing environments. If you think it would be worth supporting, I wrote a version of This format is fine for initial conditions, but it isn't too sustainable for the scoring pipeline. I originally loaded the data from this directory structure. Eventually, though, I ended up reworking the data loader into a script to transform the ERA5 data into a 73-channel preprocessed version. For the |
Beta Was this translation helpful? Give feedback.
-
Great to hear from you @ankurmahesh. Indeed implementing a plugin as an entrypoint for data sources make sense. However, for now it is possible to score with a custom data source using the python API like this: https://github.com/NVIDIA/earth2mip#basic-inference. We've been discussing the merits of CLIs vs python APIs. We'll plan to support both, but python APIs will always be more flexible for new use cases. We could use your feedback on this. |
Beta Was this translation helpful? Give feedback.
-
@NickGeneva How do you think we should support custom data sources? We can support certain file formats (i.e. hdf5+json) in the main code base, but for sometimes users won't want to transform the data. Should we have a similar plugin mechanism for the data as the models? Do we need a config layer for this?
@ankurmahesh Any thoughts?
Beta Was this translation helpful? Give feedback.
All reactions