[RFC] Multithread packing #7545

mcr229 · 2024-12-02T10:38:50Z

Enable Multithreaded packing routines in xnn_create. This allows us to multi thread our packing routines at initialization, which can help with more performant first time model loads.

mcr229 · 2024-12-02T10:40:04Z

@alankelly @dsharlet @fbarchard Putting up an RFC for multi threading packing at subgraph create. Please take a look at the commits labeld [MultiThreaded Packing][RFC].

mcr229 · 2024-12-02T10:56:41Z

see around ~4.5x perf wins on first time model load.

digantdesai · 2024-12-02T16:03:04Z

src/qb4-packw/qb4-packw.h

@@ -0,0 +1,9 @@
+
+// Copyright 2024 Google LLC


digantdesai · 2024-12-02T16:05:32Z

src/qb4-packw/kr-scalar.c.in

+  size_t extra_bytes_bl,
+  size_t extra_bytes_n,


why do we need these from the API point of view?

alankelly · 2024-12-03T09:43:56Z

src/packw.c

+      /*extra_bytes=*/context->extra_bytes_bl, 
+      /*extra_bytes_n=*/context->extra_bytes_n,
+      /*params=*/context->params);
+


rm empty line

fbarchard · 2024-12-04T19:16:12Z

src/qb4-packw/kr-scalar.c.in

+    const uint8_t* w0 = (const uint8_t*) weights;
+    const uint16_t* s0 = (const uint16_t*) scale;
+    size_t n = nc;
+    for (;n >= ${NR}; n -= ${NR}) {


fbarchard · 2024-12-04T19:20:20Z

src/qb4-packw/kr-scalar.c.in

+
+        // KC/2 bytes is KC Nibbles
+        $for N in range(1, NR):
+            const uint8_t* w${N} = w${N-1} + (kc >> 1);


this kernel does not support odd KC?
how do we ensure convolutions dont call this when kc is odd?

for x4-pack scalar kernel it still supports any KC, inefficiently
then from avxvnni I check if KC is odd and call the scalar kernel to handle it

This kernel isn't used for convolutions, and only for fully connected layers.

A constraint applied on blockwise quantization is that kc must be divisible by blocksize, and block size is a multiple 32.

code pointer? Let's make sure they are not just asserts and we fail with proper error message if not doing that already.

…rator Level API

fbarchard

Prefer the scalar kernel be in its own PR. But the kernel part of this looks ok

[Packing Refactor] Move all Blockwise Packing to pack_weights_and_bias

dd0ed28

mcr229 changed the title ~~Multithread packing~~ [RFC] Multithread packing Dec 2, 2024

digantdesai reviewed Dec 2, 2024

View reviewed changes

alankelly reviewed Dec 3, 2024

View reviewed changes

fbarchard reviewed Dec 4, 2024

View reviewed changes

mcr229 added 5 commits December 9, 2024 09:36

add back pack_gemm_goi_bl ukernel

8f76861

[Fast Packing] Add x16c8 and x16c4 packing ukernels for qb4

35b59d9

[Fast Packing] Add packing ukernels to gemm config

3f03824

[WIP] Packw Benchmarks

452b90d

[MultiThreaded Packing][RFC] Enable Multi Threaded QB4 Packing in Ope…

89cc9cd

…rator Level API

mcr229 force-pushed the multithread_packing branch from bbb8982 to eb8384b Compare December 9, 2024 17:44

[MultiThreaded Packing][RFC] Add pthreadpool to subgraph create

30bd2ad

mcr229 force-pushed the multithread_packing branch from eb8384b to 30bd2ad Compare December 9, 2024 17:51

fbarchard approved these changes Dec 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Multithread packing #7545

[RFC] Multithread packing #7545

mcr229 commented Dec 2, 2024

mcr229 commented Dec 2, 2024

mcr229 commented Dec 2, 2024

digantdesai Dec 2, 2024

digantdesai Dec 2, 2024

alankelly Dec 3, 2024

fbarchard Dec 4, 2024

fbarchard Dec 4, 2024

mcr229 Dec 4, 2024

digantdesai Dec 6, 2024

fbarchard left a comment

[RFC] Multithread packing #7545

Are you sure you want to change the base?

[RFC] Multithread packing #7545

Conversation

mcr229 commented Dec 2, 2024

mcr229 commented Dec 2, 2024

mcr229 commented Dec 2, 2024

digantdesai Dec 2, 2024

Choose a reason for hiding this comment

digantdesai Dec 2, 2024

Choose a reason for hiding this comment

alankelly Dec 3, 2024

Choose a reason for hiding this comment

fbarchard Dec 4, 2024

Choose a reason for hiding this comment

fbarchard Dec 4, 2024

Choose a reason for hiding this comment

mcr229 Dec 4, 2024

Choose a reason for hiding this comment

digantdesai Dec 6, 2024

Choose a reason for hiding this comment

fbarchard left a comment

Choose a reason for hiding this comment