Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,813 workflow runs
1,813 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Humanity's last exam
Tests #2092: Pull request #520 synchronize by clefourrier
January 29, 2025 14:20 38m 18s clem_last_exam
January 29, 2025 14:20 38m 18s
Humanity's last exam
Tests #2091: Pull request #520 synchronize by clefourrier
January 29, 2025 14:17 40m 14s clem_last_exam
January 29, 2025 14:17 40m 14s
Humanity's last exam
Tests #2090: Pull request #520 synchronize by clefourrier
January 29, 2025 09:58 39m 29s clem_last_exam
January 29, 2025 09:58 39m 29s
add missing inits (#524)
Tests #2088: Commit 48d0c28 pushed by clefourrier
January 29, 2025 07:16 39m 54s main
January 29, 2025 07:16 39m 54s
Adds missing inits to prevent import eerors
Tests #2087: Pull request #524 opened by hynky1999
January 29, 2025 00:36 39m 27s fix_inits
January 29, 2025 00:36 39m 27s
adds olympiad bench
Tests #2085: Pull request #521 synchronize by NathanHB
January 28, 2025 16:39 39m 39s nathan-adds-olympiad-bench
January 28, 2025 16:39 39m 39s
adds olympiad bench
Tests #2084: Pull request #521 synchronize by NathanHB
January 28, 2025 16:20 39m 44s nathan-adds-olympiad-bench
January 28, 2025 16:20 39m 44s
Humanity's last exam
Tests #2083: Pull request #520 synchronize by clefourrier
January 28, 2025 15:50 38m 47s clem_last_exam
January 28, 2025 15:50 38m 47s
Math extraction - allow only trying the first match, more customizabl…
Tests #2082: Commit 0e46269 pushed by hynky1999
January 28, 2025 12:57 42m 9s main
January 28, 2025 12:57 42m 9s
adds olympiad bench
Tests #2077: Pull request #521 opened by NathanHB
January 28, 2025 07:26 38m 4s nathan-adds-olympiad-bench
January 28, 2025 07:26 38m 4s
Humanity's last exam
Tests #2076: Pull request #520 opened by clefourrier
January 27, 2025 13:59 38m 52s clem_last_exam
January 27, 2025 13:59 38m 52s
Add swiss legal evals as new community tasks
Tests #2075: Pull request #389 synchronize by JoelNiklaus
January 27, 2025 13:21 Action required JoelNiklaus:add_swiss_legal_evals
January 27, 2025 13:21 Action required
Pass@k
Tests #2073: Pull request #519 synchronize by clefourrier
January 27, 2025 09:02 38m 30s clem_pass_at_k
January 27, 2025 09:02 38m 30s
Pass@k
Tests #2072: Pull request #519 synchronize by clefourrier
January 27, 2025 08:55 2m 59s clem_pass_at_k
January 27, 2025 08:55 2m 59s
Pass@k
Tests #2071: Pull request #519 synchronize by clefourrier
January 27, 2025 08:50 3m 0s clem_pass_at_k
January 27, 2025 08:50 3m 0s
Pass@k
Tests #2070: Pull request #519 opened by clefourrier
January 27, 2025 08:44 3m 1s clem_pass_at_k
January 27, 2025 08:44 3m 1s
Fixing commonsense qa: generative metrics, -1 gen length (#517)
Tests #2068: Commit cb075a5 pushed by clefourrier
January 26, 2025 17:18 38m 42s main
January 26, 2025 17:18 38m 42s
Fixing commonsense qa: generative metrics, -1 gen length
Tests #2067: Pull request #517 opened by clefourrier
January 26, 2025 12:57 38m 15s clefourrier-patch-3
January 26, 2025 12:57 38m 15s
Fix Ukrainian indices and confirmation word (#516)
Tests #2066: Commit 499cc82 pushed by clefourrier
January 26, 2025 11:04 40m 22s main
January 26, 2025 11:04 40m 22s
Fix Ukrainian indices and confirmation word
Tests #2065: Pull request #516 opened by ayukh
January 25, 2025 18:30 38m 22s ayukh:main
January 25, 2025 18:30 38m 22s
Fixed bug of import url_to_fs from fsspec (#507) (#512)
Tests #2063: Commit 4f381b3 pushed by clefourrier
January 24, 2025 10:37 39m 57s main
January 24, 2025 10:37 39m 57s