Fix normed dtype #557

rettigl · 2025-01-26T20:08:29Z

Sets the dtype of normalized data to that of unnormalized data.
Currently, it gets the dtype of the normalization histogram

coveralls · 2025-01-26T20:15:34Z

Pull Request Test Coverage Report for Build 13205728880

Details

7 of 9 (77.78%) changed or added relevant lines in 3 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.003%) to 92.177%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/sed/core/processor.py	5	7	71.43%

Totals
Change from base Build 13167417292:	0.003%
Covered Lines:	7706
Relevant Lines:	8360

💛 - Coveralls

use sed binning for histogram computation

zain-sohail · 2025-02-07T17:33:44Z

src/sed/core/processor.py

                )
            else:
                self._normalization_histogram = normalization_histogram_from_timed_dataframe(
                    self._timed_dataframe,
                    axis,
                    self._binned.coords[axis].values,
                    self._config["dataframe"]["timed_dataframe_unit_time"],
+                    hist_mode=self.config["binning"]["hist_mode"],


This seems repeated. Probably can go out of the loop

I changed the structure now to repeat less code

zain-sohail · 2025-02-07T17:34:32Z

src/sed/binning/binning.py


    Returns:
        xr.DataArray: Calculated normalization histogram.
    """
-    bins = df[axis].map_partitions(


is this removed due to the updated dask version?

I am using our optimized binning now for the timed dataframe. This is somewhat faster, does the sequential binning using the num_cores parameter, and shows the progress bar. The previous solution used the pandas cut to define bins, which requires bin edges rather than bin centers as our function.
I once checked that they produce very similar results (a very tiny difference was there, I think, because of different inclusion/exclusion of the bin edges into either left or right bin).

rettigl changed the base branch from main to v1_feature_branch January 26, 2025 20:08

rettigl requested a review from zain-sohail February 3, 2025 21:12

rettigl changed the base branch from v1_feature_branch to main February 5, 2025 21:57

rettigl added 3 commits February 6, 2025 10:45

change dtype of normalized data to that of unnormalized data

5d41991

use sed binning for histogram computation

add test for dtype

c0053ff

pass config parameters to histogram calculation

ae464dd

rettigl force-pushed the fix_normed_dtype branch from 9962e78 to ae464dd Compare February 6, 2025 09:45

zain-sohail reviewed Feb 7, 2025

View reviewed changes

select dataframe before binning call

6cc7655

zain-sohail approved these changes Feb 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix normed dtype #557

Fix normed dtype #557

rettigl commented Jan 26, 2025

coveralls commented Jan 26, 2025 •

edited

Loading

zain-sohail Feb 7, 2025

rettigl Feb 7, 2025

zain-sohail Feb 7, 2025

rettigl Feb 7, 2025

Fix normed dtype #557

Are you sure you want to change the base?

Fix normed dtype #557

Conversation

rettigl commented Jan 26, 2025

coveralls commented Jan 26, 2025 • edited Loading

Pull Request Test Coverage Report for Build 13205728880

Details

💛 - Coveralls

zain-sohail Feb 7, 2025

Choose a reason for hiding this comment

rettigl Feb 7, 2025

Choose a reason for hiding this comment

zain-sohail Feb 7, 2025

Choose a reason for hiding this comment

rettigl Feb 7, 2025

Choose a reason for hiding this comment

coveralls commented Jan 26, 2025 •

edited

Loading