Fillna does not work if fields_group is not None #1851

LeetaH666 · 2024-09-26T02:59:42Z

🐛 Bug Description

The Fillna processor does not work if fields_group is not None since assigning values to df.values changes nothing.

To Reproduce

Use any model and specify fields_group for Fillna processor.

Expected Behavior

No nan after calling Fillna.

Additional Notes

Same as the issue here: #1307 (comment).

LeetaH666 · 2024-09-26T03:11:44Z

I think simply using slice assignment would be ok:

    def __call__(self, df):
        cols = get_group_columns(df, self.fields_group)
        df.loc[:, cols] = df.loc[:, cols].fillna(self.fill_value)
        return df

LeetaH666 · 2024-09-26T04:13:00Z

Or if you want to use numpy to accelerate (I can achieve 10x speed), you should assign the df.values (or df.to_numpy()) to a variable first, then fill and assign back:

    def __call__(self, df):
        if self.fields_group is None:
            df.fillna(self.fill_value, inplace=True)
        else:
            cols = get_group_columns(df, self.fields_group)
            # this implementation is extremely slow
            # df.fillna({col: self.fill_value for col in cols}, inplace=True)

            #! similar to qlib.data.dataset.processor.Fillna, we use numpy to accelerate
            #! but instead, we assign the numpy array to a variable first
            df_values = df[cols].to_numpy()
            nan_select = np.isnan(df_values)
            #! then fill value and assign back
            df_values[nan_select] = self.fill_value
            df.loc[:, cols] = df_values
        return df

LeetaH666 added the bug Something isn't working label Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fillna does not work if fields_group is not None #1851

Fillna does not work if fields_group is not None #1851

LeetaH666 commented Sep 26, 2024

LeetaH666 commented Sep 26, 2024

LeetaH666 commented Sep 26, 2024

Fillna does not work if fields_group is not None #1851

Fillna does not work if fields_group is not None #1851

Comments

LeetaH666 commented Sep 26, 2024

🐛 Bug Description

To Reproduce

Expected Behavior

Additional Notes

LeetaH666 commented Sep 26, 2024

LeetaH666 commented Sep 26, 2024