lora : fix llama conversion script with model having ROPE_FREQS #9117

ngxson · 2024-08-21T12:55:47Z

Resolve #9114

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Ujjawal-K-Panchal · 2024-08-22T06:19:48Z

This fixes the issue #9114 I raised. Please look at the final output log here.

compilade · 2024-08-21T15:23:56Z

convert_hf_to_gguf.py

+                if not self.is_lora:
+                    self.gguf_writer.add_tensor(self.format_tensor_name(gguf.MODEL_TENSOR.ROPE_FREQS), np.array(rope_factors, dtype=np.float32))


For Phi-3, vocab-only conversion is also affected by these rope_freqs tensors, because this is in set_gguf_parameters. (which makes vocab-only Phi-3-128k models produce invalid GGUF files (this is already a problem on master))

A more general solution to both LoRA and vocab-only conversions should be possible.

Maybe some kind of self.generate_extra_tensors() which would be called by self.prepare_tensors() before it calls self.get_tensors(). And LoraModel could simply override generate_extra_tensors() to a no-op (and vocab-only conversion does not call prepare_tensors). It can be done in a follow-up PR, though.

OK I'll merge this now and will let you refactor this further in a follow-up PR.
Thank you for the help!

lora : fix llama conversion script with ROPE_FREQS

b9d6832

ngxson requested a review from compilade August 21, 2024 12:55

ngxson mentioned this pull request Aug 21, 2024

Bug: Converted HF LoRA adapter on Llama 3.1 not loading. #9114

Closed

github-actions bot added testing Everything test related python python script changes labels Aug 21, 2024

compilade approved these changes Aug 22, 2024

View reviewed changes

ngxson merged commit 3ba780e into ggerganov:master Aug 23, 2024
9 checks passed

compilade mentioned this pull request Sep 10, 2024

convert : refactor rope_freqs generation #9396

Merged

12 tasks

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

lora : fix llama conversion script with ROPE_FREQS (ggerganov#9117)

f4e85a1

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

lora : fix llama conversion script with ROPE_FREQS (ggerganov#9117)

d19ac68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lora : fix llama conversion script with model having ROPE_FREQS #9117

lora : fix llama conversion script with model having ROPE_FREQS #9117

ngxson commented Aug 21, 2024

Ujjawal-K-Panchal commented Aug 22, 2024

compilade Aug 21, 2024 •

edited

Loading

ngxson Aug 23, 2024

		if not self.is_lora:
		self.gguf_writer.add_tensor(self.format_tensor_name(gguf.MODEL_TENSOR.ROPE_FREQS), np.array(rope_factors, dtype=np.float32))

lora : fix llama conversion script with model having ROPE_FREQS #9117

lora : fix llama conversion script with model having ROPE_FREQS #9117

Conversation

ngxson commented Aug 21, 2024

Ujjawal-K-Panchal commented Aug 22, 2024

compilade Aug 21, 2024 • edited Loading

Choose a reason for hiding this comment

ngxson Aug 23, 2024

Choose a reason for hiding this comment

compilade Aug 21, 2024 •

edited

Loading