Ads parallel MLPerf accuracy eval. #1059

patemotter · 2024-11-22T22:14:59Z

For MLPerf accuracy runs we need to compute the ROUGE scores and compare against expected outputs.

This change adds a parallelized the code, which provides a speedup of 20x (10m -> 0.5m) when evaluating the 25k MLPerf dataset for Llama2-70B. Since this doesn't change the underlying computation of the scores we get the same results as the serial version.

Original Serial Results
{'rouge1': 44.3842, 'rouge2': 21.8723, 'rougeL': 28.5027, 'rougeLsum': 41.947, 'gen_len': 28135648, 'gen_num': 24576, 'gen_tok_len': 7171079, 'tokens_per_sample': 291.8}

real    10m4.329s
user    10m8.792s
sys     0m8.158s


Parallel Results
{'rouge1': 44.3842, 'rouge2': 21.8723, 'rougeL': 28.5027, 'rougeLsum': 41.947, 'gen_len': 28135648, 'gen_num': 24576, 'gen_tok_len': 7171079, 'tokens_per_sample': 291.8}

real    0m29.519s
user    13m2.545s
sys     2m43.586s

MaxText/inference_mlperf/run_v6e_microbenchmark.sh

MaxText/inference_mlperf/evaluate-accuracy.py

vipannalla · 2024-11-23T00:21:41Z

Can add a way (env_car or option in llama_offline_run.sh) to use this new and faster eval file?

patemotter requested review from gobbleturk, jonb377, khatwanimohit, bvandermoon and vipannalla as code owners November 22, 2024 22:15

patemotter force-pushed the patemotter_acc_eval branch from cc847b8 to 6f28f24 Compare November 22, 2024 22:19

patemotter assigned vipannalla and singh-mitali Nov 22, 2024

patemotter force-pushed the patemotter_acc_eval branch from 6f28f24 to 0177eb0 Compare November 22, 2024 22:23

singh-mitali reviewed Nov 22, 2024

View reviewed changes

MaxText/inference_mlperf/run_v6e_microbenchmark.sh Outdated Show resolved Hide resolved

patemotter force-pushed the patemotter_acc_eval branch from 0177eb0 to d9f468b Compare November 22, 2024 22:38

singh-mitali reviewed Nov 22, 2024

View reviewed changes

MaxText/inference_mlperf/evaluate-accuracy.py Outdated Show resolved Hide resolved

Parallelizes accuracy eval.

104d5ad

patemotter force-pushed the patemotter_acc_eval branch from d9f468b to 104d5ad Compare November 22, 2024 23:29

patemotter changed the title ~~Parallelizes MLPerf accuracy eval.~~ Ads parallel MLPerf accuracy eval. Nov 22, 2024

singh-mitali approved these changes Nov 23, 2024

View reviewed changes

vipannalla approved these changes Nov 23, 2024

View reviewed changes

github-actions bot added the pull ready label Nov 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ads parallel MLPerf accuracy eval. #1059

Ads parallel MLPerf accuracy eval. #1059

patemotter commented Nov 22, 2024 •

edited

Loading

vipannalla commented Nov 23, 2024

Ads parallel MLPerf accuracy eval. #1059

Are you sure you want to change the base?

Ads parallel MLPerf accuracy eval. #1059

Conversation

patemotter commented Nov 22, 2024 • edited Loading

vipannalla commented Nov 23, 2024

patemotter commented Nov 22, 2024 •

edited

Loading