Skip to content

Actions: intel/ipex-llm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
8,491 workflow run results
8,491 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

LLM: Update qkv fusion for GGUF-IQ2
Python Style Check #14260: Pull request #10271 synchronize by rnwang04
February 28, 2024 11:05 3m 44s update_qkv_fusion
February 28, 2024 11:05 3m 44s
LLM: Update qkv fusion for GGUF-IQ2
LLM Example Test #1107: Pull request #10271 synchronize by rnwang04
February 28, 2024 11:05 23m 28s update_qkv_fusion
February 28, 2024 11:05 23m 28s
[LLM] Add quantize_kv optimization for yuan2 model
LLM Unit Tests #4090: Pull request #10243 synchronize by sgwhat
February 28, 2024 10:31 3h 45m 38s quantize-kv-yuan2
February 28, 2024 10:31 3h 45m 38s
[LLM] Add quantize_kv optimization for yuan2 model
Python Style Check #14259: Pull request #10243 synchronize by sgwhat
February 28, 2024 10:31 3m 44s quantize-kv-yuan2
February 28, 2024 10:31 3m 44s
LLM: Update qkv fusion for GGUF-IQ2
LLM Unit Tests #4089: Pull request #10271 opened by rnwang04
February 28, 2024 10:30 34m 58s update_qkv_fusion
February 28, 2024 10:30 34m 58s
LLM: Update qkv fusion for GGUF-IQ2
LLM Example Test #1106: Pull request #10271 opened by rnwang04
February 28, 2024 10:30 26m 28s update_qkv_fusion
February 28, 2024 10:30 26m 28s
LLM: Update qkv fusion for GGUF-IQ2
Python Style Check #14258: Pull request #10271 opened by rnwang04
February 28, 2024 10:30 3m 50s update_qkv_fusion
February 28, 2024 10:30 3m 50s
[LLM] Add quantize_kv optimization for yuan2 model
LLM Unit Tests #4088: Pull request #10243 synchronize by sgwhat
February 28, 2024 10:08 23m 31s quantize-kv-yuan2
February 28, 2024 10:08 23m 31s
[LLM] Add quantize_kv optimization for yuan2 model
Python Style Check #14257: Pull request #10243 synchronize by sgwhat
February 28, 2024 10:08 4m 0s quantize-kv-yuan2
February 28, 2024 10:08 4m 0s
[LLM] Add quantize_kv optimization for yuan2 model
Python Style Check #14256: Pull request #10243 synchronize by sgwhat
February 28, 2024 10:07 1m 4s quantize-kv-yuan2
February 28, 2024 10:07 1m 4s
[LLM] Add quantize_kv optimization for yuan2 model
LLM Unit Tests #4087: Pull request #10243 synchronize by sgwhat
February 28, 2024 10:07 1m 10s quantize-kv-yuan2
February 28, 2024 10:07 1m 10s
Add webUI quickstart
Performance Regression Test #4735: Issue comment #10266 (comment) created by shane-huang
February 28, 2024 09:44 53s
February 28, 2024 09:44 53s
LLM: relax batch check of flash atttention by double check attention mask
LLM Unit Tests #4085: Pull request #10270 synchronize by rnwang04
February 28, 2024 09:28 23h 55m 41s flash_batch
February 28, 2024 09:28 23h 55m 41s
LLM: relax batch check of flash atttention by double check attention mask
Python Style Check #14254: Pull request #10270 synchronize by rnwang04
February 28, 2024 09:28 3m 48s flash_batch
February 28, 2024 09:28 3m 48s
LLM: relax batch check of flash atttention by double check attention mask
Python Style Check #14253: Pull request #10270 synchronize by rnwang04
February 28, 2024 09:21 4m 1s flash_batch
February 28, 2024 09:21 4m 1s
LLM: relax batch check of flash atttention by double check attention mask
LLM Unit Tests #4084: Pull request #10270 synchronize by rnwang04
February 28, 2024 09:21 7m 34s flash_batch
February 28, 2024 09:21 7m 34s
fix baichuan2 13b 2k input (#10267)
LLM Unit Tests #4083: Commit 64f0cc4 pushed by MeouSker77
February 28, 2024 09:20 23h 39m 33s main
February 28, 2024 09:20 23h 39m 33s
fix baichuan2 13b 2k input (#10267)
Python Style Check #14252: Commit 64f0cc4 pushed by MeouSker77
February 28, 2024 09:20 3m 42s main
February 28, 2024 09:20 3m 42s
LLM: relax batch check of flash atttention by double check attention mask
LLM Unit Tests #4082: Pull request #10270 opened by rnwang04
February 28, 2024 09:19 2m 11s flash_batch
February 28, 2024 09:19 2m 11s
LLM: relax batch check of flash atttention by double check attention mask
Python Style Check #14251: Pull request #10270 opened by rnwang04
February 28, 2024 09:19 2m 17s flash_batch
February 28, 2024 09:19 2m 17s
Fix Arc StarCoder wrong query_shape when input is long (#10268)
LLM Unit Tests #4081: Commit bde8e5c pushed by Uxito-Ada
February 28, 2024 09:07 23h 27m 7s main
February 28, 2024 09:07 23h 27m 7s
Fix Arc StarCoder wrong query_shape when input is long (#10268)
Python Style Check #14250: Commit bde8e5c pushed by Uxito-Ada
February 28, 2024 09:07 3m 46s main
February 28, 2024 09:07 3m 46s
Fix gptj failed to extend issue
LLM Unit Tests #4080: Pull request #10269 opened by cyita
February 28, 2024 09:01 23h 10m 0s cyita:fix-gptj-extend
February 28, 2024 09:01 23h 10m 0s
Fix gptj failed to extend issue
Python Style Check #14249: Pull request #10269 opened by cyita
February 28, 2024 09:01 4m 1s cyita:fix-gptj-extend
February 28, 2024 09:01 4m 1s
Fix Arc StarCoder wrong query_shape when input is long
LLM Unit Tests #4079: Pull request #10268 synchronize by Uxito-Ada
February 28, 2024 08:58 22h 49m 56s Uxito-Ada-patch-2
February 28, 2024 08:58 22h 49m 56s