ChatGLM-6B-Int4-Web-Demo

<-- press here to launch the web demo

About ChatGLM

ChatGLM-6B is an open bilingual language model based on General Language Model (GLM) framework, with 6.2 billion parameters. With the quantization technique, users can deploy locally on consumer-grade graphics cards (only 6GB of GPU memory is required at the INT4 quantization level).

ChatGLM-6B uses technology similar to ChatGPT, optimized for Chinese QA and dialogue. The model is trained for about 1T tokens of Chinese and English corpus, supplemented by supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback. With only about 6.2 billion parameters, the model is able to generate answers that are in line with human preference.

Links

16bit model on HuggingFace Hub (Requires 16GB RAM or VRAM)
4bit model on HuggingFace Hub (Requires 6GB RAM or VRAM)
Full GitHub Repo with local GPU & CPU Deployment instructions

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
ChatGLM-6B_int4_Web_Demo.ipynb		ChatGLM-6B_int4_Web_Demo.ipynb
LICENSE		LICENSE
README.md		README.md
colab_button.md		colab_button.md
requirements.txt		requirements.txt
web_demo.py		web_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatGLM-6B-Int4-Web-Demo

About ChatGLM

Links

About

Releases

Packages

Languages

License

MarkSchmidty/ChatGLM-6B-Int4-Web-Demo

Folders and files

Latest commit

History

Repository files navigation

ChatGLM-6B-Int4-Web-Demo

About ChatGLM

Links

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages