Improve MulEven performance for RVV #2334

lsrcz · 2024-09-27T20:23:12Z

This pull request improves the performance of MulEven by using masked multiplication instead of merging the results after computing the higher and lower parts.

A similar optimization can be applied to the MulOdd operator, though this would require a MaskedMulHighOr operator, which is currently unavailable.

For 8-bit elements, MulEven currently uses the immediate value 0x5555, which requires two instructions to construct on RISC-V. There may be a potential optimization by constructing the mask manually instead of relying on Dup128MaskFromMaskBits, allowing the use of the smaller immediate 0x55. However, it's just a scalar instruction, so I am unsure if we want to do so.

jan-wassenberg

Nice, looks like we're fusing OddEven into the computation. Thanks for sending the PR!
Is this also automatically found?

I would expect scalar instructions are fine, the scalar pipes might be running ahead and/or idle.

lsrcz · 2024-09-30T14:18:58Z

Yes, a synthesizer automatically finds this. I have no idea why the CI fails though -- it seems that it doesn't come from this pull request.

For MulOdd, do you think adding the masked MulHigh and do the optimization is worthwhile?

jan-wassenberg · 2024-09-30T14:27:24Z

Nice. I agree CI failures are unrelated and will fix some of that shortly.

I think MulOdd would be rarely used, so let's leave it as-is for now :)

improve MulEven performance

c6a4bd3

jan-wassenberg approved these changes Sep 30, 2024

View reviewed changes

jan-wassenberg added the ready to pull label Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve MulEven performance for RVV #2334

Improve MulEven performance for RVV #2334

lsrcz commented Sep 27, 2024

jan-wassenberg left a comment

lsrcz commented Sep 30, 2024

jan-wassenberg commented Sep 30, 2024

Improve MulEven performance for RVV #2334

Are you sure you want to change the base?

Improve MulEven performance for RVV #2334

Conversation

lsrcz commented Sep 27, 2024

jan-wassenberg left a comment

Choose a reason for hiding this comment

lsrcz commented Sep 30, 2024

jan-wassenberg commented Sep 30, 2024