Skip to content

Commit

Permalink
added new configs
Browse files Browse the repository at this point in the history
  • Loading branch information
IlyasMoutawwakil committed Nov 24, 2023
1 parent 612fcbe commit 5668117
Show file tree
Hide file tree
Showing 66 changed files with 32 additions and 4,000 deletions.
4 changes: 1 addition & 3 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -168,6 +168,4 @@ data/
version.txt

actions-runner/
experiments/
examples/
results/
experiments/
6 changes: 3 additions & 3 deletions examples/running-llamas/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,16 +7,16 @@ A set of benchmarks on Meta's LLaMA2's inference.
You will need to install these quantization packages:

```bash
pip install auto-gptq # or install it from source
pip install auto-gptq
```

## Running

Then run these commands from this directory:

```bash
optimum-benchmark --config-dir configs/ --config-name _base_ --multirun
optimum-benchmark --config-dir configs/ --config-name gptq --multirun
optimum-benchmark --config-dir configs/ --config-name fp16 --multirun
optimum-benchmark --config-dir configs/ --config-name bnb-4bit --multirun
```

This will create a folder called `experiments` with the results of the benchmarks with an inference `batch_size` ranging from 1 to 16 and an input `sequence_length` (prompt size) of 256.
Expand Down
Binary file not shown.
Binary file not shown.
11 changes: 0 additions & 11 deletions examples/running-llamas/artifacts/A100-80GB/full_report.csv

This file was deleted.

Binary file not shown.
Binary file not shown.
Loading

0 comments on commit 5668117

Please sign in to comment.