-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade to V1 #437
Upgrade to V1 #437
Conversation
Pull Request Test Coverage Report for Build 9635884851Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9635881058Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9635919510Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636056013Details
💛 - Coveralls |
Flash-loader is consistently slower with the new packages. Not sure why. |
Pull Request Test Coverage Report for Build 9636181156Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636181338Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636221019Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636258361Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636290716Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636630541Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636663837Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636664048Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636700237Details
💛 - Coveralls |
Pull Request Test Coverage Report for Build 9636700470Details
💛 - Coveralls |
This directly comes from the update of dask from 2023.5.0 to 2023.12.0, tested on python 3.9. |
Are you using the refactored branch or the one on main? Considering that the creation of buffer files has is independant of dask, I assume it's another package that causes this issue (maybe pandas?) |
It is from dask, the performance changes if I only change the version of dask (from 2024.12.0 to 2024.5.0). There seem to be some other issues with the map_overlap stuff after loading the dataframe. see #448 |
…itt from offset dict to avoid applying twice.
fix spelling in all files using Code Spell Checker
Pydantic error handling
- Update GitHub Actions workflow for documentation - Modify pyproject.toml for release adjustments
change user and system config files to config_v1.yaml add system wide .env
update tests for env variables
Fix release and update documentation
use user platformdir also for user config
@zain-sohail I made a pre-release for v1 from this now. If the package looks good, and the docs build also looks fine, I suggest you approve this PR, we make a final pre-v1 release and then merge and release v1, what do you think? Reviewing the whole thing again seems unfeasible to me. |
@zain-sohail Did you test the package? What is your plan on how and when we can merge this? |
Hi Laurenz. I have tested the package in a fresh env and also looked at the docs and everything seems fine. Dima just wanted #534 to be merged before we put out v1. I'd hence suggest we just merge it for now. Since the lab loader doesnt have integration tests right now, I can't promise no bugs but the flash loader should run bug free. And I can fix anything later as well. |
But what is the advantage with adding #534 still here, compared to first merging this, releasing 1.0, and then add it in 1.1? This is not a breaking change, but adds functionality. So it should be a minor release, from my point of view. This pr is already way too large, and adding here will make it more and more difficult to locate potential problems. |
That's also true but I'd actually say it is a breaking change due to me moving some hard coded values to the config files, which the users definitely need to have for the loader to work. |
I see, I was not aware of this. But then let's complete #534 with tests (mostly dataframe module missing) and careful reviews. This might take some time though from my side, maybe @kutnyakhov also can add a review. |
Yes, I am also waiting on Dima's review but currently he is on holiday. I'll try to add the tests today or latest this weekend. |
Having had a brief look at the config changes in #534, they don't appear to be breaking changes to me, I. E. A config pre-534 should work with the code there (or we can make it work). Thus I would still argue to merge this one now, and then build further on that release. We should not extend this pr any further if not absolutely necessary. |
I'd suggest that we merge the config model and config file changes to v1 at least before merging with main. |
Why? I don't see the need for this. Adding optional config elements is not a breaking change. #551 does something similar. Let's not complicate things unnecessarily. |
Because these are not optional changes to flash loaders, only for general sed they are optional. Earlier, these three fields were hard coded and I moved them out of flash.utils.get_channels and into the config. index: Optional[Union[Sequence[str], str]] = None
formats: Optional[Union[Sequence[str], str]] = None
fill_formats: Optional[Union[Sequence[str], str]] = None This is definitely a breaking change in the schema at least for flash data. |
Why don't you provide default values for these in #534 ? Then it would not break anything. |
I thought we weren't including loader specific defaults in default.yaml but if that's the case then you are right. |
Yes, we used to handle it like this, true. I don't completely oversee why you want to make these parameters part of the config, but I suggest that you keep the code in a shape where it would also work without. That should be the case anyways for all optional parameters, I think. Maybe we should make sure that the code itself contains defaults for all optional parameters, and not the default config, which contains defaults for all required parameters. |
Collection PR for the update to V1.
This contains breaking changes to current behavior and config file layouts.
Items to be addressed here:
src
layout Src layout #508