ggml : fix odd blocks for ARM_NEON #8556

ggerganov · 2024-07-18T07:55:36Z

target #8549

Same fix as ggml : fix iq4_nl dot product with odd number of blocks #8549 for Q4_0, Q4_1, Q5_0, Q5_1 and Q8_0
Fix IQ4_NL Metal kernel handling odd ne01
Remove Q4_0 SSE3 and Longarch special handling of first 2 blocks

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggml-ci

ggml/src/ggml-quants.c

ggml-ci

ggml/src/ggml-quants.c

ggerganov · 2024-07-19T11:03:12Z

Thanks, will merge after #8549 is merged

slaren · 2024-07-19T13:56:15Z

@ggerganov the target of this PR is #8549, so I believe this would need to be merged first. Do you intend to create a new PR targeting master afterwards?

ggerganov · 2024-07-19T13:59:08Z

Correct, it's better to merge this PR into #8549 first - will update now

* ggml : fix iq4_nl dot product with odd number of blocks * ggml : fix odd blocks for ARM_NEON (#8556) * ggml : fix iq4_nl dot product with odd number of blocks * ggml : fix q4_1 * ggml : fix q5_0 * ggml : fix q5_1 * ggml : fix iq4_nl metal ggml-ci * ggml : fix q4_0 * ggml : fix q8_0 ggml-ci * ggml : remove special Q4_0 code for first 2 blocks * ggml : fix sumf redefinition --------- Co-authored-by: slaren <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

* ggml : fix iq4_nl dot product with odd number of blocks * ggml : fix odd blocks for ARM_NEON (ggerganov#8556) * ggml : fix iq4_nl dot product with odd number of blocks * ggml : fix q4_1 * ggml : fix q5_0 * ggml : fix q5_1 * ggml : fix iq4_nl metal ggml-ci * ggml : fix q4_0 * ggml : fix q8_0 ggml-ci * ggml : remove special Q4_0 code for first 2 blocks * ggml : fix sumf redefinition --------- Co-authored-by: slaren <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

slaren and others added 5 commits July 18, 2024 02:54

ggml : fix iq4_nl dot product with odd number of blocks

90e8f81

ggml : fix q4_1

e5e7a24

ggml : fix q5_0

67b079f

ggml : fix q5_1

f6f2ff9

ggml : fix iq4_nl metal

3f68842

ggml-ci

ggerganov changed the title ~~ggml : fix q4_1~~ ggml : fix odd blocks for ARM_NEON Jul 18, 2024

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Jul 18, 2024

JohannesGaessler reviewed Jul 18, 2024

View reviewed changes

ggml/src/ggml-quants.c Show resolved Hide resolved

ggml/src/ggml-quants.c Show resolved Hide resolved

ggerganov added 3 commits July 18, 2024 14:59

ggml : fix q4_0

79b95e3

ggml : fix q8_0

62a3185

ggml-ci

ggml : remove special Q4_0 code for first 2 blocks

974410a

ggerganov force-pushed the gg/fix-odd-blocks-arm branch from 3a515b8 to 974410a Compare July 18, 2024 12:00

slaren force-pushed the sl/fix-iqnl-odd-blocks branch from 90e8f81 to cc6a0f5 Compare July 18, 2024 20:11

slaren reviewed Jul 18, 2024

View reviewed changes

ggml/src/ggml-quants.c Outdated Show resolved Hide resolved

ggml : fix sumf redefinition

32e9c41

github-actions bot added the testing Everything test related label Jul 19, 2024

mofosyne added the Review Complexity : High Generally require indepth knowledge of LLMs or GPUs label Jul 19, 2024

Merge branch 'sl/fix-iqnl-odd-blocks' into gg/fix-odd-blocks-arm

5ff83bd

ggerganov requested a review from slaren July 19, 2024 14:01

slaren approved these changes Jul 19, 2024

View reviewed changes

ggerganov merged commit 8cc26be into sl/fix-iqnl-odd-blocks Jul 19, 2024
47 of 52 checks passed

ggerganov deleted the gg/fix-odd-blocks-arm branch July 19, 2024 14:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : fix odd blocks for ARM_NEON #8556

ggml : fix odd blocks for ARM_NEON #8556

ggerganov commented Jul 18, 2024 •

edited

Loading

ggerganov commented Jul 19, 2024

slaren commented Jul 19, 2024

ggerganov commented Jul 19, 2024 •

edited

Loading

ggml : fix odd blocks for ARM_NEON #8556

ggml : fix odd blocks for ARM_NEON #8556

Conversation

ggerganov commented Jul 18, 2024 • edited Loading

ggerganov commented Jul 19, 2024

slaren commented Jul 19, 2024

ggerganov commented Jul 19, 2024 • edited Loading

ggerganov commented Jul 18, 2024 •

edited

Loading

ggerganov commented Jul 19, 2024 •

edited

Loading