Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to load leaked LLaMA weights? #55

Open
jontstaz opened this issue Apr 12, 2023 · 1 comment
Open

How to load leaked LLaMA weights? #55

jontstaz opened this issue Apr 12, 2023 · 1 comment

Comments

@jontstaz
Copy link

Hi,

Your project looks very promising. I'm curious how I can leverage the leaked LLaMA weights with Alpaca-Turbo, specifically the 65B model. Does anyone have any idea what the correct process is?

I have the following files:

  • tokenizer_checklist.chk
  • tokenizer.model
  • 65B/params.json
  • 65B/checklist.chk
  • 65B/consolidated.0X.pth (8 consolidated .pth files, numbered 00-07)

Thanks in advance,

@mvsite
Copy link

mvsite commented Apr 13, 2023

You are looking at the wrong files. Alpaca-Turbo is a web UI for the "main.exe" aka "chat.exe" binary which is the llama.cpp program for using models on CPU - so, basically long story short you want the 4bit quantized files which you can find all over on the huggingface website. Those files aren't here in most cases because the filesize is large, but also that all the ones derivative from the llama leak are legally problematic (meta owns the code and so it can't be leaked without risk of DMCA issue). You will have a pretty decent success rate by finding large files where it's a single file that ends with the ".bin" extension, and better luck where the file has "q1" or "4bit" in it. For optimal results, you could go past these hints to actually knowing what you're doing (which for the most part I don't either). Anyway new tools just dropped while I was typing this, easier to use and bundled with latest models... so not trying to be a dick and cross-promote in this guy's github but all you have to do is be in youtube or tiktok and express interest in ai and you'll be bombarded with it, everywhere, so just look around. New easy-to-use one click tool for this just dropped.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants