ggml : do not crash when quantizing q4_x_x with an imatrix #9192

slaren · 2024-08-26T14:58:59Z

bartowski1182 · 2024-08-26T17:09:41Z

to confirm, does this just makes passing imatrix a no-op? or will it actually be useful in other parts of this quant schema that don't specifically use q4_0_x_x (and therefore still worth passing/using, just not as useful)

slaren · 2024-08-26T17:44:34Z

I don't actually know what types are used in this quant schema, but if there are other types, they will still be able to use the imatrix, it will only be the q4_x_x (and q8_0) tensors that will not use the imatrix.

…#9192)

ggml : do not crash when quantizing q4_x_x with an imatrix

e279ce0

ggerganov approved these changes Aug 26, 2024

View reviewed changes

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Aug 26, 2024

slaren merged commit 7d787ed into master Aug 26, 2024
53 checks passed

slaren deleted the sl/fix-q4xx-imatrix branch August 26, 2024 17:44

ThomasBaruzier mentioned this pull request Aug 26, 2024

Bug: Coredump when quanting to Q4_0_*_* with imatrix #8767

Closed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

ggml : do not crash when quantizing q4_x_x with an imatrix (ggerganov…

5975ac4

…#9192)

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

ggml : do not crash when quantizing q4_x_x with an imatrix (ggerganov…

b114805

…#9192)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : do not crash when quantizing q4_x_x with an imatrix #9192

ggml : do not crash when quantizing q4_x_x with an imatrix #9192

slaren commented Aug 26, 2024

bartowski1182 commented Aug 26, 2024

slaren commented Aug 26, 2024

ggml : do not crash when quantizing q4_x_x with an imatrix #9192

ggml : do not crash when quantizing q4_x_x with an imatrix #9192

Conversation

slaren commented Aug 26, 2024

bartowski1182 commented Aug 26, 2024

slaren commented Aug 26, 2024