Add GPU example for GLM-4 #11267

lzivan · 2024-06-07T08:39:30Z

Description

-Add and update GPU example for GLM-4

python/llm/example/GPU/HF-Transformers-AutoModels/Model/glm4/generate.py

python/llm/example/GPU/PyTorch-Models/Model/glm4/generate.py

python/llm/example/GPU/HF-Transformers-AutoModels/Model/glm4/generate.py

qiuxin2012 · 2024-06-11T02:25:27Z

python/llm/example/GPU/HF-Transformers-AutoModels/Model/glm4/README.md

+## Example 2: Stream Chat using `stream_chat()` API
+In the example [streamchat.py](./streamchat.py), we show a basic use case for a GLM-4 model to stream chat, with IPEX-LLM INT4 optimizations.
+### 1. Install
+#### 1.1 Installation on Linux


this environment in Example 2 is duplicated, can we merge this with Example 1?

This is kind of following the template. We do this in case user only refers to example 2, and for the case that environments are different for different examples. :)

We may update the readme of ChatGLM3 series in further PRs

Fix pretrained arguments in generate and streamchat.py

Update install tiktoken required for GLM-4

python/llm/example/GPU/HF-Transformers-AutoModels/Model/glm4/generate.py

python/llm/example/GPU/HF-Transformers-AutoModels/Model/glm4/streamchat.py

python/llm/example/GPU/PyTorch-Models/Model/glm4/generate.py

lzivan added 2 commits June 7, 2024 15:57

Add GPU example for GLM-4

7ba833a

Update streamchat.py

ea18de2

Oscilloscope98 requested review from qiuxin2012 and Oscilloscope98 June 11, 2024 02:14

Oscilloscope98 reviewed Jun 11, 2024

View reviewed changes

qiuxin2012 reviewed Jun 11, 2024

View reviewed changes

lzivan added 2 commits June 11, 2024 14:12

Fix pretrianed arguments

83b514d

Fix pretrained arguments in generate and streamchat.py

Update Readme

32b44a3

Update install tiktoken required for GLM-4

Oscilloscope98 reviewed Jun 12, 2024

View reviewed changes

Update comments in generate.py

c739169

Oscilloscope98 approved these changes Jun 12, 2024

View reviewed changes

Oscilloscope98 merged commit 40fc870 into intel:main Jun 12, 2024
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPU example for GLM-4 #11267

Add GPU example for GLM-4 #11267

lzivan commented Jun 7, 2024

qiuxin2012 Jun 11, 2024

Oscilloscope98 Jun 11, 2024

Oscilloscope98 Jun 12, 2024

Add GPU example for GLM-4 #11267

Add GPU example for GLM-4 #11267

Conversation

lzivan commented Jun 7, 2024

Description

qiuxin2012 Jun 11, 2024

Choose a reason for hiding this comment

Oscilloscope98 Jun 11, 2024

Choose a reason for hiding this comment

Oscilloscope98 Jun 12, 2024

Choose a reason for hiding this comment