[NNPA] Memory reduction of stickified constant by stickifying at file writing #2917

imaihal · 2024-08-26T01:49:10Z

This PR reduces memory usage for NNPA compilation. Current main branch creates stickified data in ZHighConstPropagationPass and keeps the data until compilation finish. This PR sets original data, not stickified data, in ZHighConstPropagationPass. Then, in the KrnlToLLVMPass, stickfied data is created and stored in the file, and deleted after writing into the file.

Signed-off-by: Haruki Imai <[email protected]>

imaihal · 2024-09-11T09:02:10Z

@jenkins-droid test this please.

Signed-off-by: Haruki Imai <[email protected]>

chentong319 · 2024-10-29T14:06:31Z

src/Accelerators/NNPA/Conversion/ZHighToZLow/ZHighToZLow.cpp

-           "The stickified tensor's buffer size and MemRef's size mismatched");
-
-    // Create a KrnlGlobalOp.
-    KrnlGlobalOp constantGlobal =


Keep the previous implementation with KrnlGlobalOp in comment or if false branch, if you do not want to create an option to control the choice. You can define an option '--disable-krnl-constant-to-file' with default value of 'false'.

OK. Is this because we may reuse the previous implementation in the future?

I created directive NNPA_ZHIGH_STICKIFIEDCONST_GEN to keep the original implementation. Currently commented out, but I confirmed it works when enabling this code.

yes, we may reuse or compare the original implementation.

Signed-off-by: Haruki Imai <[email protected]>

imaihal · 2024-10-31T08:47:19Z

Could you review again?

AlexandreEichenberger · 2024-10-31T13:06:51Z

@chentong319 Do you mind reviewing it? You have been very involved with this PR. Thanks

Signed-off-by: Haruki Imai <[email protected]>

chentong319

LGTM!

imaihal · 2024-11-05T08:47:39Z

@jenkins-droid test this please.

jenkins-droid · 2024-11-05T14:49:30Z

Jenkins Linux s390x Build #15951 [push] [NNPA] Memory reduction ... started at 09:49

jenkins-droid · 2024-11-05T14:49:31Z

Jenkins Linux amd64 Build #15948 [push] [NNPA] Memory reduction ... started at 08:49

jenkins-droid · 2024-11-05T14:49:34Z

Jenkins Linux ppc64le Build #14978 [push] [NNPA] Memory reduction ... started at 10:03

jenkins-droid · 2024-11-05T16:09:30Z

Jenkins Linux amd64 Build #15948 [push] [NNPA] Memory reduction ... passed after 1 hr 20 min

jenkins-droid · 2024-11-05T16:14:09Z

Jenkins Linux s390x Build #15951 [push] [NNPA] Memory reduction ... passed after 1 hr 24 min

jenkins-droid · 2024-11-05T17:08:11Z

Jenkins Linux ppc64le Build #14978 [push] [NNPA] Memory reduction ... passed after 2 hr 18 min

… at file writing (onnx#2917)" This reverts commit 33b466e. Signed-off-by: Tung D. Le <[email protected]>

… at file writing (onnx#2917)" This reverts commit 33b466e.

… at file writing (onnx#2917)" This reverts commit 33b466e. Signed-off-by: Tung D. Le <[email protected]>

imaihal added 9 commits August 4, 2024 23:48

Add and use ConstantOpInterface for KrnlGlobalOps.

a589aa2

Signed-off-by: Haruki Imai <[email protected]>

Add and use ConstantOpInterface in lowering of KrnlGlobalOp to LLVMIR.

76e9dcb

Signed-off-by: Haruki Imai <[email protected]>

Initial implementation for NNPA.

84c13c0

Signed-off-by: Haruki Imai <[email protected]>

Update to handle stickifiedConstantOp initialized with zero

42cb5a1

Signed-off-by: Haruki Imai <[email protected]>

Update to free memory correctly

5ac7b1f

Signed-off-by: Haruki Imai <[email protected]>

Clean up

e8b935d

Signed-off-by: Haruki Imai <[email protected]>

Clean up

82ada4e

Signed-off-by: Haruki Imai <[email protected]>

Merge branch 'main' into mem_reduction_stickified

5dff7cf

Signed-off-by: Haruki Imai <[email protected]>

format

9c5dd88

Signed-off-by: Haruki Imai <[email protected]>

imaihal changed the title ~~[NNPA] Memory reduction by running stickification at file wrting~~ [NNPA] Memory reduction of stickified constant by stickifying at file writing Aug 26, 2024

imaihal added 16 commits August 26, 2024 03:31

Merge branch 'main' into mem_reduction_stickified

2f23ff8

Fix for lstm and gru.

7a5fb6d

Signed-off-by: Haruki Imai <[email protected]>

Merge branch 'main' into mem_reduction_stickified

fd82e47

Fix the case totalsize is less than or equal to totalThreshold.

748517f

Signed-off-by: Haruki Imai <[email protected]>

Merge branch 'main' into mem_reduction_stickified

4cb46dd

Fix the case without setting --store-constants-to-file option.

434272a

Signed-off-by: Haruki Imai <[email protected]>

Fix lit tests.

18b9919

Signed-off-by: Haruki Imai <[email protected]>

Fix getBuffersize() for CategoryMapperOp

b99a334

Signed-off-by: Haruki Imai <[email protected]>

Update attributes in ZHigh/ZLowStickfiedConstantOp.

16773e7

Signed-off-by: Haruki Imai <[email protected]>

Use stickified attribute and zeroconst attribute

9e236c9

Signed-off-by: Haruki Imai <[email protected]>

Attribute name change: zeroconst to allzero

56bc50d

Signed-off-by: Haruki Imai <[email protected]>

Update lit tests.

4f08bef

Signed-off-by: Haruki Imai <[email protected]>

clean up.

67d0f20

Signed-off-by: Haruki Imai <[email protected]>

Add an option.

dbf4c82

Signed-off-by: Haruki Imai <[email protected]>

Merge branch 'main' into mem_reduction_stickified

208020b

Signed-off-by: Haruki Imai <[email protected]>

The option is true by default.

ac37742

Signed-off-by: Haruki Imai <[email protected]>

imaihal added 3 commits September 11, 2024 21:41

Set the option false by default for testing.

ad06734

Signed-off-by: Haruki Imai <[email protected]>

Revert lit-tests for testing.

5dcc2f7

Signed-off-by: Haruki Imai <[email protected]>

Set false in store-constants-to-file for testing.

d18539e

Signed-off-by: Haruki Imai <[email protected]>

Remove #pragma

b5415c3

Signed-off-by: Haruki Imai <[email protected]>

chentong319 reviewed Oct 29, 2024

View reviewed changes

imaihal added 3 commits October 29, 2024 23:48

Set stickified attr as mandatory attr.

f0b92f0

Signed-off-by: Haruki Imai <[email protected]>

Update descriptions for the OpInterface.

5ee9e77

Signed-off-by: Haruki Imai <[email protected]>

Keep original implementation

3d167da

Signed-off-by: Haruki Imai <[email protected]>

imaihal requested review from chentong319, AlexandreEichenberger and tungld October 31, 2024 08:47

imaihal added 2 commits October 31, 2024 23:55

clean up

8192993

Signed-off-by: Haruki Imai <[email protected]>

Merge branch 'main' into mem_reduction_stickified

5147329

chentong319 approved these changes Nov 4, 2024

View reviewed changes

imaihal added 2 commits November 4, 2024 21:16

Merge branch 'main' into mem_reduction_stickified

40ef9a2

Merge branch 'main' into mem_reduction_stickified

51e5e3d

chentong319 approved these changes Nov 5, 2024 •

edited

Loading

View reviewed changes

imaihal merged commit 33b466e into onnx:main Nov 5, 2024
7 checks passed

imaihal deleted the mem_reduction_stickified branch November 5, 2024 14:48

tungld added a commit to tungld/onnx-mlir that referenced this pull request Nov 13, 2024

Revert "[NNPA] Memory reduction of stickified constant by stickifying…

f5a25af

… at file writing (onnx#2917)" This reverts commit 33b466e. Signed-off-by: Tung D. Le <[email protected]>

tungld mentioned this pull request Nov 14, 2024

[Testing] Use DisposableElementsAttr in doing constant propagation for zTensor #3009

Closed

tungld added a commit to tungld/onnx-mlir that referenced this pull request Nov 18, 2024

Revert "[NNPA] Memory reduction of stickified constant by stickifying…

89122a4

… at file writing (onnx#2917)" This reverts commit 33b466e.

tungld mentioned this pull request Nov 18, 2024

Use DisposableElementsAttr for ZHigh constant propagation #3013

Open

tungld added a commit to tungld/onnx-mlir that referenced this pull request Nov 18, 2024

Revert "[NNPA] Memory reduction of stickified constant by stickifying…

c99e50d

… at file writing (onnx#2917)" This reverts commit 33b466e. Signed-off-by: Tung D. Le <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NNPA] Memory reduction of stickified constant by stickifying at file writing #2917

[NNPA] Memory reduction of stickified constant by stickifying at file writing #2917

imaihal commented Aug 26, 2024 •

edited

Loading

imaihal commented Sep 11, 2024

chentong319 Oct 29, 2024 •

edited

Loading

imaihal Oct 31, 2024

imaihal Oct 31, 2024

chentong319 Nov 4, 2024

imaihal commented Oct 31, 2024

AlexandreEichenberger commented Oct 31, 2024

chentong319 left a comment

imaihal commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

[NNPA] Memory reduction of stickified constant by stickifying at file writing #2917

[NNPA] Memory reduction of stickified constant by stickifying at file writing #2917

Conversation

imaihal commented Aug 26, 2024 • edited Loading

imaihal commented Sep 11, 2024

chentong319 Oct 29, 2024 • edited Loading

Choose a reason for hiding this comment

imaihal Oct 31, 2024

Choose a reason for hiding this comment

imaihal Oct 31, 2024

Choose a reason for hiding this comment

chentong319 Nov 4, 2024

Choose a reason for hiding this comment

imaihal commented Oct 31, 2024

AlexandreEichenberger commented Oct 31, 2024

chentong319 left a comment

Choose a reason for hiding this comment

imaihal commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

jenkins-droid commented Nov 5, 2024

imaihal commented Aug 26, 2024 •

edited

Loading

chentong319 Oct 29, 2024 •

edited

Loading