Replies: 1 comment 9 replies
-
Do you mean generate a new model file, or actually modify the second one directly? Swapping tensors (or layers which are sets of tensors) in the second case would only be possible if the sizes were exactly the same. So you could only swap tensors from an f16 source to an f16 destination, etc. |
Beta Was this translation helpful? Give feedback.
9 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Suggestion: If you can make a tool that pick up a range of layers of a modelA.gguf and swap/overwrite directly where is specified in the modelB.gguf, on f16, q8_0 and other quants if possible. if it could use mmapping to do it without using too much memory.
Beta Was this translation helpful? Give feedback.
All reactions