ConvInteger: fix parsing for x_zero_point and w_zero_point #3763

kahmed10 · 2025-01-16T03:38:52Z

The previous test did not properly expose the bug when parsing ConvInteger found in SD3. Updated the parser and updated the unit test.
I also updated some function and types for clarity.
I also needed to add an exception for layout in find_inner_broadcast. The unit test otherwise would not run because the zero points were broadcasted, and applying layout to a 1d tensor with 4d permutation would not work.
Feel free to suggest better variable names.

I noticed allocation segments reaching close to uint64 max value, which clearly would throw an out of memory error when trying to allocate on the GPU. This happened when MIGRAPHX_NSTREAMS was set to 2 or greater and the model somehow was large enough to trigger it. Changing from `auto` to `size_t` seems to fix the issue.

…m_coloring_fix

…rse_qconv_bias_fix

src/onnx/parse_convolution.cpp

…layout

kahmed10 · 2025-01-16T13:49:45Z

Some further details on the bug:
The previous logic after checking if x_zp and w_zp were not symmetric was faulty. There was already an implicit multibroadcast in the logic by using add_common_op, so that's why it didn't throw any errors. And the shapes of the test inputs happened to be in a way that the broadcasting rules did not break.

Previous input shapes:

x -> [1,3,5,5]
w -> [1,3,2,2]
conv(x,w) -> [1,1,3,3]

The previous logic would then look at x_zp and find it's not symmetric.
After which it would use add_common_op to do conv(x_zp,w), but that would multibroadcast x_zp to shape [1,3,2,2].
The resulting shape of conv(x_zp,w) would be (1,1,1,1).
ret=conv(x,w)-conv(x_zp,w) -> [1,1,3,3] - [1,1,1,1] -> [1,1,3,3] - [1,1,3,3] technically works.
Similarly, using add_common_op to conv(x,w_zp) would multibroadcast w_zp to shape [1,3,5,5].
The resulting shape of conv(x,w_zp) would also be (1,1,1,1).
ret=ret-conv(x,w_zp) -> [1,1,3,3] - [1,1,1,1] -> [1,1,3,3] - [1,1,3,3] also technically works.
Then the final add_common_op for adding conv(x_zp,w_zp) would also technically work:
ret=ret-conv(x_zp,w_zp) -> [1,1,3,3] - [1,1,1,1] -> [1,1,3,3] - [1,1,3,3]

But now if you have the new input shapes:

x_new -> [2,3,10,10]
w_new -> [4,3,3,3]
conv(x_new,w_w) -> [2,4,8,8]

Now if you tried to use add_common_op for conv(x_zp,w_new), you'd get the following:

ret=conv(x_new,w_new)-conv(z_zp,w_new) -> [2,4,8,8] - [4,4,1,1] -> incompatible shapes

TL;DR: The previous logic of using add_common_op was wrong. The test case just so happened to produce broadcastable shapes. Bad test case did not uncover the bug in the logic.

TedThemistokleous · 2025-01-16T16:25:43Z

src/onnx/parse_convolution.cpp

+
+        // multibroadcast (or broadcast) zero points according to spec
+        // x_zp should be a scalar or literal with one element
+        // w_zp can be either a single element or a 1d tensor with size out_channels


Comment: Good to add the comment here!

lakhinderwalia

Thanks for adding the general description, and it helps a lot. And for modifying the test_cases. Approved.

…rse_qconv_bias_fix

migraphx-bot · 2025-01-18T03:36:53Z

Test	Batch	Rate new 1ce1f9	Rate old 150105	Diff	Compare
torchvision-resnet50	64	3,253.57	3,254.33	-0.02%	✅
torchvision-resnet50_fp16	64	6,927.88	6,927.70	0.00%	✅
torchvision-densenet121	32	2,454.74	2,454.03	0.03%	✅
torchvision-densenet121_fp16	32	4,182.86	4,166.00	0.40%	✅
torchvision-inceptionv3	32	1,630.27	1,630.12	0.01%	✅
torchvision-inceptionv3_fp16	32	2,714.81	2,716.54	-0.06%	✅
cadene-inceptionv4	16	763.30	763.13	0.02%	✅
cadene-resnext64x4	16	813.38	813.13	0.03%	✅
slim-mobilenet	64	7,463.06	7,460.93	0.03%	✅
slim-nasnetalarge	64	208.67	208.67	-0.00%	✅
slim-resnet50v2	64	3,444.86	3,446.18	-0.04%	✅
bert-mrpc-onnx	8	1,148.90	1,146.26	0.23%	✅
bert-mrpc-tf	1	489.20	481.46	1.61%	✅
pytorch-examples-wlang-gru	1	484.53	470.60	2.96%	✅
pytorch-examples-wlang-lstm	1	454.03	443.44	2.39%	✅
torchvision-resnet50_1	1	807.66	804.59	0.38%	✅
cadene-dpn92_1	1	429.19	430.86	-0.39%	✅
cadene-resnext101_1	1	385.86	386.12	-0.07%	✅
onnx-taau-downsample	1	373.45	372.92	0.14%	✅
dlrm-criteoterabyte	1	33.31	33.33	-0.07%	✅
dlrm-criteoterabyte_fp16	1	52.36	52.63	-0.51%	✅
agentmodel	1	8,700.12	8,589.15	1.29%	✅
unet_fp16	2	58.56	58.37	0.33%	✅
resnet50v1_fp16	1	1,028.12	1,036.76	-0.83%	✅
resnet50v1_int8	1	1,022.10	1,039.07	-1.63%	✅
bert_base_cased_fp16	64	1,180.85	1,182.01	-0.10%	✅
bert_large_uncased_fp16	32	365.23	365.27	-0.01%	✅
bert_large_fp16	1	201.65	202.67	-0.50%	✅
distilgpt2_fp16	16	2,227.31	2,227.86	-0.02%	✅
yolov5s	1	537.79	523.22	2.79%	✅
tinyllama	1	43.60	43.57	0.06%	✅
vicuna-fastchat	1	157.73	173.92	-9.31%	🔴
whisper-tiny-encoder	1	417.40	418.33	-0.22%	✅
whisper-tiny-decoder	1	429.83	430.88	-0.24%	✅

This build is not recommended to merge 🔴

migraphx-bot · 2025-01-18T03:36:55Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large: PASSED: MIGraphX meets tolerance

✅ yolov5s: PASSED: MIGraphX meets tolerance

✅ tinyllama: PASSED: MIGraphX meets tolerance

✅ vicuna-fastchat: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-encoder: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-decoder: PASSED: MIGraphX meets tolerance

✅ distilgpt2_fp16: PASSED: MIGraphX meets tolerance

codecov · 2025-01-20T22:07:00Z

Codecov Report

Attention: Patch coverage is 92.30769% with 1 line in your changes missing coverage. Please review.

Project coverage is 92.28%. Comparing base (976ae75) to head (314b6ba).
Report is 1 commits behind head on develop.

Files with missing lines	Patch %	Lines
src/onnx/parse_convolution.cpp	90.90%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #3763   +/-   ##
========================================
  Coverage    92.28%   92.28%           
========================================
  Files          519      519           
  Lines        22222    22227    +5     
========================================
+ Hits         20507    20512    +5     
  Misses        1715     1715

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kahmed10 added 8 commits January 11, 2025 06:12

update copyright and add test case

be58a75

update test copyright

031d500

Merge branch 'develop' of https://github.com/ROCm/AMDMIGraphX into me…

eb3d71e

…m_coloring_fix

fix convinteger bias parsing and updated test

582b281

formatting

a017473

update onnx file

ac12de4

Merge branch 'develop' of https://github.com/ROCm/AMDMIGraphX into pa…

cd10d9a

…rse_qconv_bias_fix

kahmed10 requested a review from causten as a code owner January 16, 2025 03:38

kahmed10 requested review from TedThemistokleous and lakhinderwalia and removed request for causten January 16, 2025 03:38

kahmed10 self-assigned this Jan 16, 2025

kahmed10 added bugfix Fixes a bug found in the code. simple small or simple changes labels Jan 16, 2025

cleanup qparam_broadcast_op function

276d9d4

lakhinderwalia reviewed Jan 16, 2025

View reviewed changes

src/onnx/parse_convolution.cpp Show resolved Hide resolved

kahmed10 force-pushed the parse_qconv_bias_fix branch from ab9d13f to 276d9d4 Compare January 16, 2025 13:10

fix test channels and fix simplify_algebra find_inner_broadcasts for …

9bf587b

…layout

kahmed10 added 4 commits January 16, 2025 15:19

update convinteger_bias_test

c956ea7

formatting

0d68d62

update license

fd0931f

Merge branch 'develop' into parse_qconv_bias_fix

a5c22fa

TedThemistokleous reviewed Jan 16, 2025

View reviewed changes

lakhinderwalia approved these changes Jan 16, 2025

View reviewed changes

kahmed10 added 4 commits January 17, 2025 05:23

reuse smaller test case for verify

ada1625

Merge branch 'develop' of https://github.com/ROCm/AMDMIGraphX into pa…

722b571

…rse_qconv_bias_fix

update license year

d90aae0

add missing onnx file

f388ed2

TedThemistokleous approved these changes Jan 17, 2025

View reviewed changes

kahmed10 added 2 commits January 17, 2025 21:29

fix filepath

4c3f92e

Merge branch 'develop' of https://github.com/ROCm/AMDMIGraphX into pa…

1ce1f95

…rse_qconv_bias_fix

kahmed10 added 2 commits January 20, 2025 21:55

update verify_onnx tests

3ab07bc

formatting

f5a5f09

kahmed10 added 2 commits January 20, 2025 16:07

revert build script

23013ae

fix licensing

314b6ba

kahmed10 merged commit f36eba4 into develop Jan 21, 2025
17 of 21 checks passed

kahmed10 deleted the parse_qconv_bias_fix branch January 21, 2025 04:21

causten pushed a commit that referenced this pull request Jan 24, 2025

ConvInteger: fix parsing for x_zero_point and w_zero_point (#3763)

748ed97

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ConvInteger: fix parsing for x_zero_point and w_zero_point #3763

ConvInteger: fix parsing for x_zero_point and w_zero_point #3763

kahmed10 commented Jan 16, 2025 •

edited

Loading

kahmed10 commented Jan 16, 2025

TedThemistokleous Jan 16, 2025

lakhinderwalia left a comment

migraphx-bot commented Jan 18, 2025

migraphx-bot commented Jan 18, 2025

codecov bot commented Jan 20, 2025 •

edited

Loading

ConvInteger: fix parsing for x_zero_point and w_zero_point #3763

ConvInteger: fix parsing for x_zero_point and w_zero_point #3763

Conversation

kahmed10 commented Jan 16, 2025 • edited Loading

kahmed10 commented Jan 16, 2025

TedThemistokleous Jan 16, 2025

Choose a reason for hiding this comment

lakhinderwalia left a comment

Choose a reason for hiding this comment

migraphx-bot commented Jan 18, 2025

migraphx-bot commented Jan 18, 2025

codecov bot commented Jan 20, 2025 • edited Loading

Codecov Report

kahmed10 commented Jan 16, 2025 •

edited

Loading

codecov bot commented Jan 20, 2025 •

edited

Loading