-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
may use SDMX interface #39
Comments
Thanks for opening this issue @epogrebnyak The SDMX approach has some advantages, most importantly standardisation of data and metadata. It also bring in both national and regional data together, which the package currently lacks. There are some limitations. SDMX data is not available before 2017, this can be a limiting factor because a big advantage of the package is allowing access to historical releases. To keep this advantage you would need to integrate both SDMX and xls data together. I had also come across an issue with one of the 2021 releases (corrupted files) and I'm not sure if the files had been fixed by the IMF team. In terms of implementation, this could be problematic because the field names are slightly different between the SDMX data and the xls data. There is also some renaming and reformatting being done by the package that would need to be refactored. I'm not sure all that would need to change but I imagine there would be some breaking changes to the UI. One example is the In terms of the priorities for our work at the ONE Campaign, we are most interested in the data extraction bit which becomes a component of our ETL. We would need to be quite reactive to new releases as well so we would still likely rely on some of our own tooling in that process, in case of breakages and need for maintenance of the tool. The advantages of To benefit both of our purposes, I propose I repackage the tool we created into a thin api for the SDMX data, which There are some other enhancements to Let me know your thoughts on my proposition and if you have other ideas |
All good ideas, what is the entry point for SDMX and how is it documented? |
The releases come along with a SDMX Data Structure Definition. I would start there. This helper class we created parses the data to a dataframe. You can look at our implementation there |
Is main action happening here? The SDMX is a zip file and then you process it into a dataframe? Is it roughly a URL -> ZipFile -> pd.DataFrame? |
Partly yes. The full extraction pipeline is run by this function |
See tools developped by @ONEcampaign, @jm-rivera, @lpicci96 team:
https://github.com/ONEcampaign/bblocks/blob/main/bblocks/import_tools/imf_weo.py
The text was updated successfully, but these errors were encountered: