Skip to content

Actions: huggingface/lighteval

Quality

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,812 workflow runs
1,812 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Sync Math-verify
Quality #2118: Pull request #535 synchronize by hynky1999
February 5, 2025 00:47 2m 11s sync_math_verify
February 5, 2025 00:47 2m 11s
Sync Math-verify
Quality #2117: Pull request #535 synchronize by hynky1999
February 5, 2025 00:45 2m 25s sync_math_verify
February 5, 2025 00:45 2m 25s
Sync Math-verify
Quality #2116: Pull request #535 opened by hynky1999
February 5, 2025 00:26 2m 9s sync_math_verify
February 5, 2025 00:26 2m 9s
Add GPQA for instruct models
Quality #2115: Pull request #534 synchronize by lewtun
February 4, 2025 16:14 2m 6s add-gpqa-generative
February 4, 2025 16:14 2m 6s
Fix loading of vllm model from files
Quality #2114: Pull request #533 synchronize by NathanHB
February 4, 2025 15:10 2m 13s nathan-fix-vllm-from-file
February 4, 2025 15:10 2m 13s
Add GPQA for instruct models
Quality #2113: Pull request #534 synchronize by lewtun
February 4, 2025 14:57 2m 13s add-gpqa-generative
February 4, 2025 14:57 2m 13s
Add GPQA for instruct models
Quality #2112: Pull request #534 opened by lewtun
February 4, 2025 14:55 2m 9s add-gpqa-generative
February 4, 2025 14:55 2m 9s
Fix loading of vllm model from files
Quality #2111: Pull request #533 synchronize by NathanHB
February 4, 2025 14:07 2m 10s nathan-fix-vllm-from-file
February 4, 2025 14:07 2m 10s
Fix loading of vllm model from files
Quality #2110: Pull request #533 opened by NathanHB
February 4, 2025 14:05 2m 13s nathan-fix-vllm-from-file
February 4, 2025 14:05 2m 13s
Add custom task (bac-fr) for evaluation of models in french (#518)
Quality #2109: Commit d7a1f11 pushed by clefourrier
February 3, 2025 16:08 2m 19s main
February 3, 2025 16:08 2m 19s
Update french_evals.py
Quality #2108: Commit be7da17 pushed by clefourrier
February 3, 2025 12:13 2m 8s main
February 3, 2025 12:13 2m 8s
Add swiss legal evals as new community tasks
Quality #2107: Pull request #389 synchronize by JoelNiklaus
February 1, 2025 10:55 Action required JoelNiklaus:add_swiss_legal_evals
February 1, 2025 10:55 Action required
Add swiss legal evals as new community tasks
Quality #2106: Pull request #389 synchronize by JoelNiklaus
February 1, 2025 10:37 Action required JoelNiklaus:add_swiss_legal_evals
February 1, 2025 10:37 Action required
Multi node vLLM
Quality #2105: Pull request #530 synchronize by ncassereau
February 1, 2025 08:41 Action required ncassereau:multi_node_vllm
February 1, 2025 08:41 Action required
Add custom task (bac-fr) for evaluation of models in french
Quality #2104: Pull request #518 synchronize by mdiazmel
January 31, 2025 16:57 2m 16s mdiazmel:main
January 31, 2025 16:57 2m 16s
adds olympiad bench (#521)
Quality #2103: Commit d332207 pushed by NathanHB
January 31, 2025 14:20 2m 6s main
January 31, 2025 14:20 2m 6s
Multi node vLLM
Quality #2102: Pull request #530 opened by ncassereau
January 31, 2025 13:53 Action required ncassereau:multi_node_vllm
January 31, 2025 13:53 Action required
Update links in readme
Quality #2101: Pull request #527 opened by jaysonfrancis
January 30, 2025 21:36 Action required jaysonfrancis:main
January 30, 2025 21:36 Action required
Humanity's last exam
Quality #2100: Pull request #520 synchronize by clefourrier
January 30, 2025 18:53 2m 33s clem_last_exam
January 30, 2025 18:53 2m 33s
adds olympiad bench
Quality #2098: Pull request #521 synchronize by NathanHB
January 30, 2025 13:22 2m 3s nathan-adds-olympiad-bench
January 30, 2025 13:22 2m 3s
Improve readability of the quick tour. (#501)
Quality #2097: Commit 515bd01 pushed by clefourrier
January 30, 2025 13:11 2m 7s main
January 30, 2025 13:11 2m 7s
Add Doc Strings to Config Files
Quality #2096: Pull request #465 synchronize by ParagEkbote
January 30, 2025 12:23 Action required ParagEkbote:Document-Custom-Model-Files
January 30, 2025 12:23 Action required
Implemented the possibility to load predictions from details files an…
Quality #2094: Commit 94fc5a2 pushed by NathanHB
January 29, 2025 14:59 2m 12s main
January 29, 2025 14:59 2m 12s
Humanity's last exam
Quality #2093: Pull request #520 synchronize by clefourrier
January 29, 2025 14:20 2m 13s clem_last_exam
January 29, 2025 14:20 2m 13s
Humanity's last exam
Quality #2092: Pull request #520 synchronize by clefourrier
January 29, 2025 14:17 2m 9s clem_last_exam
January 29, 2025 14:17 2m 9s