Skip to content

Commit

Permalink
Modify arc nightly perf to fp16 (#11275)
Browse files Browse the repository at this point in the history
* change api

* move to pr mode and remove the build

* add batch4 yaml and remove the bigcode

* remove batch4

* revert the starcode

* remove the exclude

* revert

---------

Co-authored-by: Yishuo Wang <[email protected]>
  • Loading branch information
MargarettMao and MeouSker77 authored Jun 17, 2024
1 parent a2a5890 commit bca5cbd
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 4 deletions.
2 changes: 1 addition & 1 deletion python/llm/test/benchmark/arc-perf-test-batch2.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ in_out_pairs:
- '1024-128'
- '2048-256'
test_api:
- "transformer_int4_gpu" # on Intel GPU
- "transformer_int4_fp16_gpu" # on Intel GPU
cpu_embedding: False # whether put embedding to CPU (only avaiable now for gpu win related test_api)
exclude:
- 'bigcode/starcoder-15.5b-4bit:2048'
Expand Down
2 changes: 1 addition & 1 deletion python/llm/test/benchmark/arc-perf-test.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ in_out_pairs:
- '1024-128'
- '2048-256'
test_api:
- "transformer_int4_gpu" # on Intel GPU
- "transformer_int4_fp16_gpu" # on Intel GPU
cpu_embedding: False # whether put embedding to CPU (only avaiable now for gpu win related test_api)
exclude:
# - 'fnlp/moss-moon-003-sft-4bit:1024'
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -15,5 +15,5 @@ in_out_pairs:
- '1024-128'
- '2048-256'
test_api:
- "transformer_int4_gpu" # on Intel GPU
- "transformer_int4_fp16_gpu" # on Intel GPU
cpu_embedding: False # whether put embedding to CPU (only avaiable now for gpu win related test_api)
2 changes: 1 addition & 1 deletion python/llm/test/benchmark/arc-perf-transformers-437.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,5 +15,5 @@ in_out_pairs:
- '1024-128'
- '2048-256'
test_api:
- "transformer_int4_gpu" # on Intel GPU
- "transformer_int4_fp16_gpu" # on Intel GPU
cpu_embedding: False # whether put embedding to CPU (only avaiable now for gpu win related test_api)

0 comments on commit bca5cbd

Please sign in to comment.