update NPU examples #11540

leonardozcm · 2024-07-09T07:31:59Z

Description

Update verified models.

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md

jason-dai · 2024-07-09T09:21:03Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md

-In this directory, you will find examples on how you could apply IPEX-LLM INT4 or INT8 optimizations on LLM models on [Intel NPUs](../../../README.md). For illustration purposes, we utilize the [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) as reference Llama2 models. For more verified models please refer to the [Verification Models](#verification-models) section.
+In this directory, you will find examples on how you could apply IPEX-LLM INT4 or INT8 optimizations on LLM models on [Intel NPUs](../../../README.md). For illustration purposes, we utilize the [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) as reference Llama2 models. In this directory, you will find examples on how you could apply IPEX-LLM INT4 or INT8 optimizations on LLM models on Intel NPUs. See the table blow for verified models.
+
+## Verification Models


Verified Models

Updated in #11548

* update NPU examples

leonardozcm added 2 commits July 9, 2024 15:30

update NPU examples

72892b9

unverified model

6e0a56a

leonardozcm requested review from jason-dai and MeouSker77 July 9, 2024 07:33

MeouSker77 approved these changes Jul 9, 2024

View reviewed changes

jason-dai reviewed Jul 9, 2024

View reviewed changes

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md Outdated Show resolved Hide resolved

jason-dai reviewed Jul 9, 2024

View reviewed changes

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md Outdated Show resolved Hide resolved

update

de400d8

leonardozcm merged commit 76a5802 into intel-analytics:main Jul 9, 2024
1 check passed

leonardozcm deleted the update_npu_example branch July 9, 2024 09:19

jason-dai reviewed Jul 9, 2024

View reviewed changes

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md Show resolved Hide resolved

jason-dai reviewed Jul 9, 2024

View reviewed changes

RyuKosei pushed a commit to RyuKosei/ipex-llm that referenced this pull request Jul 19, 2024

update NPU examples (intel-analytics#11540)

80d08e9

* update NPU examples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update NPU examples #11540

update NPU examples #11540

leonardozcm commented Jul 9, 2024

jason-dai Jul 9, 2024

leonardozcm Jul 10, 2024

update NPU examples #11540

update NPU examples #11540

Conversation

leonardozcm commented Jul 9, 2024

Description

jason-dai Jul 9, 2024

Choose a reason for hiding this comment

leonardozcm Jul 10, 2024

Choose a reason for hiding this comment