Gemma models quantized using llamacpp not working in lm studio #5706

rombodawg · 2024-02-25T05:12:46Z

Gemma models that have been quantized using Llamacpp are not working. Please look into the issue

error

"llama.cpp error: 'create_tensor: tensor 'output.weight' not found'"

I will open a issue on the lm studio github aswell addressing this

lmstudio-ai/configs#21

System:
Ryzen 5600x
rtx 3080 gpu
b550 motherboard
64gb ddr4 ram
windows 10 OS

Yefori-Go · 2024-02-25T08:56:42Z

Perhaps you should try using the latest llama.cpp to convert gemma model.

python .\convert-hf-to-gguf.py models\gemma-2b-it\ --outfile gemma-2b-it-f16.gguf

JohannesGaessler · 2024-02-25T18:04:51Z

I can only speak for myself but I 100% refuse to debug a problem unless it can be reproduced entirely with open-source code.

rombodawg · 2024-02-26T02:03:58Z

No it literally doesnt work

I just built this version of llamacpp. And that .py script doesnt work for Gemma

Plus that not the script you are even suppose to use according to the documentation. You are suppose to use convert.py

NotImplementedError: Architecture "GemmaForCausalLM" not supported!

E:\Open_source_ai_chatbot\Llamacpp-3\llama.cpp>python E:\Open_source_ai_chatbot\Llamacpp-mixtral\llamacpp-clone-mixtral\convert-hf-to-gguf.py E:\Open_source_ai_chatbot\OOBA_10\text-generation-webui-main\models\Gemma-EveryoneLLM-7b-test --outfile Gemma-EveryoneLLM-7b-test.gguf --outtype f16
Loading model: Gemma-EveryoneLLM-7b-test
Traceback (most recent call last):
  File "E:\Open_source_ai_chatbot\Llamacpp-mixtral\llamacpp-clone-mixtral\convert-hf-to-gguf.py", line 1033, in <module>    model_instance = model_class(dir_model, ftype_map[args.outtype], fname_out, args.bigendian)
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "E:\Open_source_ai_chatbot\Llamacpp-mixtral\llamacpp-clone-mixtral\convert-hf-to-gguf.py", line 48, in __init__
    self.model_arch = self._get_model_architecture()
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "E:\Open_source_ai_chatbot\Llamacpp-mixtral\llamacpp-clone-mixtral\convert-hf-to-gguf.py", line 225, in _get_model_architecture
    raise NotImplementedError(f'Architecture "{arch}" not supported!')
NotImplementedError: Architecture "GemmaForCausalLM" not supported!

rombodawg · 2024-02-26T02:13:17Z

@JohannesGaessler I totally understand that, hopefully openai is willing to reach out and work with you to fix this

rombodawg · 2024-02-26T06:04:04Z

Im uploading the model files for the merges if anyone wants to do some debugging. Should be in the next 10 hours or so. Sorry slow internet.

Follow the mulit-thread. And check out my model for debugging.

Thread links:
lmstudio-ai/configs#21
#5706
arcee-ai/mergekit#181
oobabooga/text-generation-webui#5562

https://huggingface.co/rombodawg/Gemme-Merge-Test-7b

hiepxanh · 2024-03-15T05:25:15Z

@rombodawg did you try latest version?
#6051

it already support

github-actions · 2024-04-30T01:15:50Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

DementedWeasel1971 · 2024-05-10T10:55:27Z

I think like me people are considering other options. I will however keep on watching the release notes to see when this is fixed.

rombodawg added the bug-unconfirmed label Feb 25, 2024

rombodawg mentioned this issue Feb 25, 2024

Gemma models still not working. lmstudio-ai/configs#21

Open

This was referenced Feb 26, 2024

Add support for Google Gemma Model oobabooga/text-generation-webui#5562

Closed

Merges Gemma models are not loading in llamacpp-python or LM studio with Gemma support arcee-ai/mergekit#181

Closed

hcsolakoglu mentioned this issue Mar 2, 2024

Gemma models do not work when converted to gguf format after training unslothai/unsloth#213

Closed

github-actions bot added the stale label Apr 15, 2024

github-actions bot closed this as completed Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma models quantized using llamacpp not working in lm studio #5706

Gemma models quantized using llamacpp not working in lm studio #5706

rombodawg commented Feb 25, 2024 •

edited

Loading

Yefori-Go commented Feb 25, 2024

JohannesGaessler commented Feb 25, 2024

rombodawg commented Feb 26, 2024 •

edited

Loading

rombodawg commented Feb 26, 2024

rombodawg commented Feb 26, 2024 •

edited

Loading

hiepxanh commented Mar 15, 2024 •

edited

Loading

github-actions bot commented Apr 30, 2024

DementedWeasel1971 commented May 10, 2024

Gemma models quantized using llamacpp not working in lm studio #5706

Gemma models quantized using llamacpp not working in lm studio #5706

Comments

rombodawg commented Feb 25, 2024 • edited Loading

Yefori-Go commented Feb 25, 2024

JohannesGaessler commented Feb 25, 2024

rombodawg commented Feb 26, 2024 • edited Loading

rombodawg commented Feb 26, 2024

rombodawg commented Feb 26, 2024 • edited Loading

hiepxanh commented Mar 15, 2024 • edited Loading

github-actions bot commented Apr 30, 2024

DementedWeasel1971 commented May 10, 2024

rombodawg commented Feb 25, 2024 •

edited

Loading

rombodawg commented Feb 26, 2024 •

edited

Loading

rombodawg commented Feb 26, 2024 •

edited

Loading

hiepxanh commented Mar 15, 2024 •

edited

Loading