-
Notifications
You must be signed in to change notification settings - Fork 10k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Converted HF LoRA adapter on Llama 3.1 not loading. #9114
Labels
bug-unconfirmed
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
Comments
Ujjawal-K-Panchal
added
bug-unconfirmed
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
labels
Aug 21, 2024
4 tasks
Can you try the conversion script from #9117 ? |
Thank you so much for the quick response! Testing this. |
Now the above mentioned works perfectly. I also tried different quantizations. I see no problems. Putting the log snippet of the received output below:
Thanks for the help! |
Keeping this open till the PR is merged. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug-unconfirmed
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
What happened?
In short: Using the standard procedure from documents, I am unable to attach a converted LoRA adapter (hf -> GGUF) to a Llama3.1 GGUF model.
Procedure:
lora-dir/
for later access. (Usingtrl.SFTTrainer
; saved usingoutput_dir
parameter)..gguf
format via the prescribed method (convert_hf_to_gguf.py
).python convert_lora_to_gguf.py ../lora-dir/ --outfile ../lora-dir/llama31-lora.gguf --outtype bf16 --base ../models/models--meta-llama--Meta-Llama-3.1-8B-Instruct/snapshots/8c22764a7e3675c50d4c7c9a4edb474456022b16/
./llama-cli -m ../modelstore/llama31-Q4_K_M-v2.gguf --lora ./lora-dir/llama31-freedom-lora-v010.gguf
.Additional notes:
convert_lora_to_gguf.py
.snapshot: 8c22764a7e3675c50d4c7c9a4edb474456022b16
is the current default in llama 3.1.Name and Version
version: 3484 (4730fac)
built with cc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0 for x86_64-linux-gnu
What operating system are you seeing the problem on?
Linux
Relevant log output
The text was updated successfully, but these errors were encountered: