Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,811 workflow runs
1,811 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Sync Math-verify
Tests #2117: Pull request #535 synchronize by hynky1999
February 5, 2025 00:47 39m 5s sync_math_verify
February 5, 2025 00:47 39m 5s
Sync Math-verify
Tests #2116: Pull request #535 synchronize by hynky1999
February 5, 2025 00:45 42m 22s sync_math_verify
February 5, 2025 00:45 42m 22s
Sync Math-verify
Tests #2115: Pull request #535 opened by hynky1999
February 5, 2025 00:26 44m 10s sync_math_verify
February 5, 2025 00:26 44m 10s
Add GPQA for instruct models
Tests #2114: Pull request #534 synchronize by lewtun
February 4, 2025 16:14 40m 37s add-gpqa-generative
February 4, 2025 16:14 40m 37s
Fix loading of vllm model from files
Tests #2113: Pull request #533 synchronize by NathanHB
February 4, 2025 15:10 38m 49s nathan-fix-vllm-from-file
February 4, 2025 15:10 38m 49s
Add GPQA for instruct models
Tests #2112: Pull request #534 synchronize by lewtun
February 4, 2025 14:57 39m 3s add-gpqa-generative
February 4, 2025 14:57 39m 3s
Add GPQA for instruct models
Tests #2111: Pull request #534 opened by lewtun
February 4, 2025 14:55 40m 49s add-gpqa-generative
February 4, 2025 14:55 40m 49s
Fix loading of vllm model from files
Tests #2110: Pull request #533 synchronize by NathanHB
February 4, 2025 14:07 39m 58s nathan-fix-vllm-from-file
February 4, 2025 14:07 39m 58s
Fix loading of vllm model from files
Tests #2109: Pull request #533 opened by NathanHB
February 4, 2025 14:05 40m 54s nathan-fix-vllm-from-file
February 4, 2025 14:05 40m 54s
Add custom task (bac-fr) for evaluation of models in french (#518)
Tests #2108: Commit d7a1f11 pushed by clefourrier
February 3, 2025 16:08 41m 20s main
February 3, 2025 16:08 41m 20s
Update french_evals.py
Tests #2107: Commit be7da17 pushed by clefourrier
February 3, 2025 12:13 39m 21s main
February 3, 2025 12:13 39m 21s
Add swiss legal evals as new community tasks
Tests #2106: Pull request #389 synchronize by JoelNiklaus
February 1, 2025 10:55 Action required JoelNiklaus:add_swiss_legal_evals
February 1, 2025 10:55 Action required
Add swiss legal evals as new community tasks
Tests #2105: Pull request #389 synchronize by JoelNiklaus
February 1, 2025 10:37 Action required JoelNiklaus:add_swiss_legal_evals
February 1, 2025 10:37 Action required
Multi node vLLM
Tests #2104: Pull request #530 synchronize by ncassereau
February 1, 2025 08:41 Action required ncassereau:multi_node_vllm
February 1, 2025 08:41 Action required
Add custom task (bac-fr) for evaluation of models in french
Tests #2103: Pull request #518 synchronize by mdiazmel
January 31, 2025 16:57 37m 50s mdiazmel:main
January 31, 2025 16:57 37m 50s
adds olympiad bench (#521)
Tests #2102: Commit d332207 pushed by NathanHB
January 31, 2025 14:20 39m 4s main
January 31, 2025 14:20 39m 4s
Multi node vLLM
Tests #2101: Pull request #530 opened by ncassereau
January 31, 2025 13:53 Action required ncassereau:multi_node_vllm
January 31, 2025 13:53 Action required
Update links in readme
Tests #2100: Pull request #527 opened by jaysonfrancis
January 30, 2025 21:36 Action required jaysonfrancis:main
January 30, 2025 21:36 Action required
Humanity's last exam
Tests #2099: Pull request #520 synchronize by clefourrier
January 30, 2025 18:53 39m 36s clem_last_exam
January 30, 2025 18:53 39m 36s
adds olympiad bench
Tests #2097: Pull request #521 synchronize by NathanHB
January 30, 2025 13:22 37m 59s nathan-adds-olympiad-bench
January 30, 2025 13:22 37m 59s
Improve readability of the quick tour. (#501)
Tests #2096: Commit 515bd01 pushed by clefourrier
January 30, 2025 13:11 38m 33s main
January 30, 2025 13:11 38m 33s
Add Doc Strings to Config Files
Tests #2095: Pull request #465 synchronize by ParagEkbote
January 30, 2025 12:23 Action required ParagEkbote:Document-Custom-Model-Files
January 30, 2025 12:23 Action required
Implemented the possibility to load predictions from details files an…
Tests #2093: Commit 94fc5a2 pushed by NathanHB
January 29, 2025 14:59 38m 34s main
January 29, 2025 14:59 38m 34s
Humanity's last exam
Tests #2092: Pull request #520 synchronize by clefourrier
January 29, 2025 14:20 38m 18s clem_last_exam
January 29, 2025 14:20 38m 18s
Humanity's last exam
Tests #2091: Pull request #520 synchronize by clefourrier
January 29, 2025 14:17 40m 14s clem_last_exam
January 29, 2025 14:17 40m 14s