Xnn s32 or #6823

umadevimcw · 2024-08-08T09:38:16Z

Bitwise OR implementation

fbarchard · 2024-08-09T02:53:19Z

src/amalgam/gen/avx512f.c

+    xnn_storeu_s32(output + 1 * xnn_simd_size_s32, vy_1);
+    output += 32;
+  }
+  for (; batch >= xnn_simd_bytes_s32; batch -= xnn_simd_bytes_s32) {


you use batch -= 32 * sizeof(int32_t)) { for the main loop. which is the correct amount for avx512
should this one be batch -= 16 * sizeof(int32_t)) {

@fbarchard vin1_0, vin1_1 two set of inputs are processed here, like loop unrolling of 2 so its 32

vin1 is from input_a and is 16 ints
vin2 is from input_b and is 16 ints
should the loop be doing:
for (; batch >= 16 * sizeof(int32_t); batch -= 16 * sizeof(int32_t)) {

fbarchard · 2024-08-09T02:55:33Z

src/amalgam/gen/avx512f.c

+    output += xnn_simd_size_s32;
+  }
+  if XNN_UNLIKELY(batch != 0) {
+    xnn_simd_s32_t vin1 = xnn_load_tail_s32(input_a, batch >> XNN_LOG2_SIZEOF_INT32_T);


normally for native we'd create a mask based on the remainder and use it for all the instructions.
the loop here would be the same as the previous loop, but with a mask
its also prudent to put an assert on the expected batch size to ensure we dont accidently have too much or too little for the remainder masking to work

umadevimcw · 2024-08-12T05:51:01Z

OR op is part of #6836. Hence closing it

umadevimcw added 6 commits August 8, 2024 12:49

Add wrapper instructions for OR op

4f96ac2

Add microkernel ops for OR

7b48f02

Add or s32 op

9cbcbbc

Add OR op subgraph

f6cc4db

Add or node and fix compilation issue

52c84e5

Fix avx512 mismatch issue

446136e

umadevimcw force-pushed the xnn_s32_or branch from e51a91c to 446136e Compare August 8, 2024 13:06

fbarchard reviewed Aug 9, 2024

View reviewed changes

umadevimcw closed this Aug 12, 2024

umadevimcw deleted the xnn_s32_or branch August 12, 2024 06:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xnn s32 or #6823

Xnn s32 or #6823

umadevimcw commented Aug 8, 2024

fbarchard Aug 9, 2024

umadevimcw Aug 9, 2024

fbarchard Aug 9, 2024

fbarchard Aug 9, 2024

umadevimcw commented Aug 12, 2024

Xnn s32 or #6823

Xnn s32 or #6823

Conversation

umadevimcw commented Aug 8, 2024

fbarchard Aug 9, 2024

Choose a reason for hiding this comment

umadevimcw Aug 9, 2024

Choose a reason for hiding this comment

fbarchard Aug 9, 2024

Choose a reason for hiding this comment

fbarchard Aug 9, 2024

Choose a reason for hiding this comment

umadevimcw commented Aug 12, 2024