Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 #8855

0cc4m · 2024-08-04T15:57:30Z

I fixed Vulkan quantized matrix vector multiplication test failure on AMD GPUs (warp size 64) when there are not enough blocks to fill the warp. This was caught by the tests added in #8800 , but I noticed that for k-quants they run the same test twice, so I added a check whether the new test is actually required. Let me know if that's okay.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

…ready covered

…ov#8855) * Fix Vulkan mul mat vec invalid results when ncols < warp size * Only run backend ops mul mat vec block size test if block size not already covered

0cc4m added 2 commits August 4, 2024 17:44

Fix Vulkan mul mat vec invalid results when ncols < warp size

ecabd54

Only run backend ops mul mat vec block size test if block size not al…

6c75cb9

…ready covered

github-actions bot added the testing Everything test related label Aug 4, 2024

0cc4m changed the title ~~0cc4m/vulkan fix mmv tests~~ Fix Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 Aug 4, 2024

JohannesGaessler added the Vulkan Issues specific to the Vulkan backend label Aug 4, 2024

0cc4m changed the title ~~Fix Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64~~ Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 Aug 5, 2024

ggerganov approved these changes Aug 5, 2024

View reviewed changes

ggerganov merged commit 064cdc2 into master Aug 5, 2024
54 checks passed

0cc4m deleted the 0cc4m/vulkan-fix-mmv-tests branch August 5, 2024 06:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 #8855

Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 #8855

0cc4m commented Aug 4, 2024

Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 #8855

Fix Vulkan Quantized Matrix Vector Multiplication on AMD GPUs when ncols < 64 #8855

Conversation

0cc4m commented Aug 4, 2024