fix: Fixes wrong input type for raw_dtype in ggml to gguf scripts #8928

farbodbj · 2024-08-08T10:06:13Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

A wrong data type has been passed from the add_tensor and add_tensor_info to this function causing another exception while raising another exception. So I changed the input types based on the name raw_dtype and later converted it to an GGMLQuantizationType object which can also handle invalid arguments itself.
related issue #8929

compilade · 2024-08-10T16:43:30Z

Thanks for finding this and fixing it. There has been many refactors lately where the old convert_llama_ggml_to_gguf.py was not tested at all. (mostly because I don't have old GGML models around to test)

But I think the type of the raw_dtype parameter of GGUFWriter.add_tensor and GGUFWriter.add_tensor_info should stay GGMLQuantizationType (because it's easier to figure how to use that parameter if its type is more specific than int), and that convert_llama_ggml_to_gguf.py should be fixed instead.

Would this work?

diff --git a/convert_llama_ggml_to_gguf.py b/convert_llama_ggml_to_gguf.py
index 7b00b439..701df869 100755
--- a/convert_llama_ggml_to_gguf.py
+++ b/convert_llama_ggml_to_gguf.py
@@ -116,7 +116,7 @@ class Tensor:
         assert quant is not None, 'Unknown tensor type'
         (blksize, tysize) = quant
         offset += 12
-        self.dtype= dtype
+        self.dtype= gguf.GGMLQuantizationType(dtype)
         self.dims = struct.unpack(f'<{n_dims}I', data[offset:offset + (4 * n_dims)])
         offset += 4 * n_dims
         self.name = bytes(data[offset:offset + name_len])

farbodbj · 2024-08-10T18:17:56Z

@compilade
You're right using GGMLQuantizationType is kind of more readable and idempotent. I tested your suggested fix (created a patch and applied it), and it fixes this issue. I used your fix and committed the changes again.
There was another problem with this script too, I'll soon create the issue and I'm trying to fix it but I need to get to know the ggml and gguf format more.
Thanks for your review.

farbodbj · 2024-08-11T16:09:39Z

Is the failed CI check required for merging this PR, do I need to do anything about it? as it does not seem to be related to this PR.

compilade · 2024-08-11T17:38:34Z

Is the failed CI check required for merging this PR, do I need to do anything about it? as it does not seem to be related to this PR.

You don't need to do anything about it; a fix is pending in #8982, and the source of the problem was identified in #7599 (comment).

farbodbj · 2024-08-16T09:25:54Z

@compilade @ggerganov
Since This PR has been approved and labeled as merge-ready but has had no activity in the past 5 days, it came to me to prevent it from becoming stale I could mention you in the comments to take some action on it.

ggerganov · 2024-08-16T10:36:39Z

Thanks for the reminder!

…ganov#8928) Co-authored-by: farbod <[email protected]>

github-actions bot added the python python script changes label Aug 8, 2024

farbodbj mentioned this pull request Aug 8, 2024

Bug: exception while rasing a another exception in convert_llama_ggml_to_gguf script #8929

Closed

mofosyne added bugfix fixes an issue or bug Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix labels Aug 9, 2024

fix: Fixes wrong input type for raw_dtype in ggml to gguf scripts

66a4225

farbodbj force-pushed the fix-ggml-to-gguf-script branch from 451e52f to 66a4225 Compare August 10, 2024 18:16

compilade approved these changes Aug 10, 2024

View reviewed changes

compilade added the merge ready indicates that this may be ready to merge soon and is just holding out in case of objections label Aug 12, 2024

ggerganov merged commit ee2984b into ggerganov:master Aug 16, 2024
8 of 9 checks passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

py : fix wrong input type for raw_dtype in ggml to gguf scripts (gger…

5b85cc6

…ganov#8928) Co-authored-by: farbod <[email protected]>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

py : fix wrong input type for raw_dtype in ggml to gguf scripts (gger…

f5b4304

…ganov#8928) Co-authored-by: farbod <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Fixes wrong input type for raw_dtype in ggml to gguf scripts #8928

fix: Fixes wrong input type for raw_dtype in ggml to gguf scripts #8928

farbodbj commented Aug 8, 2024 •

edited

Loading

compilade commented Aug 10, 2024 •

edited

Loading

farbodbj commented Aug 10, 2024 •

edited

Loading

farbodbj commented Aug 11, 2024

compilade commented Aug 11, 2024

farbodbj commented Aug 16, 2024 •

edited

Loading

ggerganov commented Aug 16, 2024

fix: Fixes wrong input type for raw_dtype in ggml to gguf scripts #8928

fix: Fixes wrong input type for raw_dtype in ggml to gguf scripts #8928

Conversation

farbodbj commented Aug 8, 2024 • edited Loading

compilade commented Aug 10, 2024 • edited Loading

farbodbj commented Aug 10, 2024 • edited Loading

farbodbj commented Aug 11, 2024

compilade commented Aug 11, 2024

farbodbj commented Aug 16, 2024 • edited Loading

ggerganov commented Aug 16, 2024

farbodbj commented Aug 8, 2024 •

edited

Loading

compilade commented Aug 10, 2024 •

edited

Loading

farbodbj commented Aug 10, 2024 •

edited

Loading

farbodbj commented Aug 16, 2024 •

edited

Loading