quantized model ? Llama cpp? #2

thiswillbeyourgithub · 2024-02-19T09:22:26Z

Hi,

Reading your articles made me really curious about trying that but I was wondering of it was possible to use HuggingFace's quantized models or even llamacpp or if that required deep changes.

Thanks!

vgel · 2024-02-22T07:44:01Z

Working on a llama.cpp implementation!

vgel · 2024-03-10T07:17:41Z

There's now a PR live on the llama.cpp repo: ggerganov/llama.cpp#5970

vgel · 2024-03-26T06:02:21Z

That PR is merged, so closing this issue.

vgel closed this as completed Mar 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantized model ? Llama cpp? #2

quantized model ? Llama cpp? #2

thiswillbeyourgithub commented Feb 19, 2024

vgel commented Feb 22, 2024

vgel commented Mar 10, 2024

vgel commented Mar 26, 2024

quantized model ? Llama cpp? #2

quantized model ? Llama cpp? #2

Comments

thiswillbeyourgithub commented Feb 19, 2024

vgel commented Feb 22, 2024

vgel commented Mar 10, 2024

vgel commented Mar 26, 2024