-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ConvInteger: fix parsing for x_zero_point and w_zero_point #3763
Conversation
I noticed allocation segments reaching close to uint64 max value, which clearly would throw an out of memory error when trying to allocate on the GPU. This happened when MIGRAPHX_NSTREAMS was set to 2 or greater and the model somehow was large enough to trigger it. Changing from `auto` to `size_t` seems to fix the issue.
…m_coloring_fix
…rse_qconv_bias_fix
ab9d13f
to
276d9d4
Compare
Some further details on the bug: Previous input shapes:
The previous logic would then look at x_zp and find it's not symmetric. But now if you have the new input shapes:
Now if you tried to use
TL;DR: The previous logic of using |
|
||
// multibroadcast (or broadcast) zero points according to spec | ||
// x_zp should be a scalar or literal with one element | ||
// w_zp can be either a single element or a 1d tensor with size out_channels |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comment: Good to add the comment here!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding the general description, and it helps a lot. And for modifying the test_cases. Approved.
…rse_qconv_bias_fix
This build is not recommended to merge 🔴 |
🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #3763 +/- ##
========================================
Coverage 92.28% 92.28%
========================================
Files 519 519
Lines 22222 22227 +5
========================================
+ Hits 20507 20512 +5
Misses 1715 1715 ☔ View full report in Codecov by Sentry. |
find_inner_broadcast
. The unit test otherwise would not run because the zero points were broadcasted, and applying layout to a 1d tensor with 4d permutation would not work.