Allow optimizing mask conversions on x64 as well #110195

tannergooding · 2024-11-26T18:20:41Z

This mostly just extends the mask conversion optimization to light-up on x64 as well. In order to achieve that it mostly just adds in the minor different handling for the conversion cost and ensuring the right operand is accessed.

It additionally adds support for one more important scenario, which is recognizing that ConditionalSelect despite taking a vector in IR has special support to be lowered/contained such that the mask can be consumed directly. -- The same is technically also possible for the various bitwise operations, but those are less important to handle initially.

dotnet-policy-service · 2024-11-26T18:21:14Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

jakobbotsch · 2024-11-29T10:20:14Z

src/coreclr/jit/optimizemaskconversions.cpp

+                        // We don't actually have a convert here, but we do have a case where
+                        // the mask is being used in a ConditionalSelect and therefore can be
+                        // consumed directly as a mask. While the IR shows TYP_SIMD, it gets
+                        // handled in lowering as part of the general embedded-mask support.


In other words the conditional select operations support both TYP_SIMD and TYP_MASK for the mask operand, right? With the TYP_SIMD one being a 0/1 in each lane, and the TYP_MASK one being a bit mask.

In other words the conditional select operations support both TYP_SIMD and TYP_MASK for the mask operand, right?

Right.

With the TYP_SIMD one being a 0/1 in each lane, and the TYP_MASK one being a bit mask.

Rather TYP_SIMD is Zero or AllBitsSet in each lane and TYP_MASK is a compressed form being a bitmask (1-bit per element).

This comes about from SIMD comparisons returning Zero/AllBitsSet per lane such that it can be used with all bitmask operations, not just with conditional select or similar.

jakobbotsch

LGTM. Also cc @a74nh for awareness

* Allow optimizing mask conversions on x64 as well * Ensure the right operand is accessed on xarch * Minimally handle CndSel as part of optimizing mask conversions * Add some additional comments and clean up the logic a bit * Apply formatting patch

Allow optimizing mask conversions on x64 as well

329d8e4

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Nov 26, 2024

dotnet-policy-service bot assigned tannergooding Nov 26, 2024

Ensure the right operand is accessed on xarch

1cb24aa

tannergooding force-pushed the msk-lcl-x64 branch from e9f236c to 1cb24aa Compare November 27, 2024 01:55

Minimally handle CndSel as part of optimizing mask conversions

58e20f9

tannergooding force-pushed the msk-lcl-x64 branch from dea61f4 to 58e20f9 Compare November 27, 2024 05:36

build-analysis bot mentioned this pull request Nov 27, 2024

restarted. Azure DevOps can't recover from restarts. dotnet/dnceng#3879

Open

3 tasks

Add some additional comments and clean up the logic a bit

94841e9

tannergooding marked this pull request as ready for review November 27, 2024 17:46

build-analysis bot mentioned this pull request Nov 27, 2024

SIGKILL (OOM?) while running LibraryImportGenerator.Tests w/o actionable log messages or artifacts dotnet/dnceng#2496

Open

3 tasks

Apply formatting patch

578b171

jakobbotsch reviewed Nov 29, 2024

View reviewed changes

jakobbotsch approved these changes Nov 29, 2024

View reviewed changes

tannergooding merged commit e7d837d into dotnet:main Nov 30, 2024
107 of 108 checks passed

tannergooding deleted the msk-lcl-x64 branch November 30, 2024 17:49

amanasifkhalid mentioned this pull request Dec 2, 2024

windows/x64: Assertion failed 'unreached' during 'Physical promotion' #110326

Closed

github-actions bot locked and limited conversation to collaborators Dec 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow optimizing mask conversions on x64 as well #110195

Allow optimizing mask conversions on x64 as well #110195

tannergooding commented Nov 26, 2024 •

edited

Loading

dotnet-policy-service bot commented Nov 26, 2024

jakobbotsch Nov 29, 2024

tannergooding Nov 30, 2024

jakobbotsch left a comment

Allow optimizing mask conversions on x64 as well #110195

Allow optimizing mask conversions on x64 as well #110195

Conversation

tannergooding commented Nov 26, 2024 • edited Loading

dotnet-policy-service bot commented Nov 26, 2024

jakobbotsch Nov 29, 2024

Choose a reason for hiding this comment

tannergooding Nov 30, 2024

Choose a reason for hiding this comment

jakobbotsch left a comment

Choose a reason for hiding this comment

tannergooding commented Nov 26, 2024 •

edited

Loading