Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

quantized model ? Llama cpp? #2

Closed
thiswillbeyourgithub opened this issue Feb 19, 2024 · 3 comments
Closed

quantized model ? Llama cpp? #2

thiswillbeyourgithub opened this issue Feb 19, 2024 · 3 comments

Comments

@thiswillbeyourgithub
Copy link

Hi,

Reading your articles made me really curious about trying that but I was wondering of it was possible to use HuggingFace's quantized models or even llamacpp or if that required deep changes.

Thanks!

@vgel
Copy link
Owner

vgel commented Feb 22, 2024

Working on a llama.cpp implementation!

@vgel
Copy link
Owner

vgel commented Mar 10, 2024

There's now a PR live on the llama.cpp repo: ggerganov/llama.cpp#5970

@vgel
Copy link
Owner

vgel commented Mar 26, 2024

That PR is merged, so closing this issue.

@vgel vgel closed this as completed Mar 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants