Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature - New Metrics #335

Merged
merged 21 commits into from
Feb 6, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
04cd514
Update model methods to run background task when creating the endpoin…
shincap8 Jan 17, 2025
d86ae16
Handle error from Runpod, URL inference from yaml file
shincap8 Jan 17, 2025
8794693
Update backend/app/api/endpoints/base/score.py
shincap8 Jan 17, 2025
fc08634
Rename method
shincap8 Jan 17, 2025
aa0f022
Merge with cloud branch
shincap8 Jan 17, 2025
971b986
rename endpoint
shincap8 Jan 17, 2025
476d4e4
Print background tasks and correct typo
shincap8 Jan 21, 2025
9ed5f20
resolve merge conflict
shincap8 Jan 21, 2025
326acbb
Merge branch 'main' of https://github.com/mlcommons/dynabench into Fe…
shincap8 Jan 22, 2025
68f126e
Correct typos and correct schemas
shincap8 Jan 22, 2025
c2638b9
model uid
shincap8 Jan 22, 2025
605d8fb
merge with main
shincap8 Jan 22, 2025
0d74498
remove c
shincap8 Jan 22, 2025
9c6f088
Merge branch 'main' of https://github.com/mlcommons/dynabench into Fe…
shincap8 Jan 22, 2025
ac0b90b
Merge branch 'main' of https://github.com/mlcommons/dynabench into Fe…
shincap8 Jan 22, 2025
bbb2695
New metrics for MLSuperb challenge
shincap8 Jan 22, 2025
6b0f1c6
Correct pretty name
shincap8 Jan 22, 2025
de489f5
Merge branch 'main' of https://github.com/mlcommons/dynabench into Fe…
shincap8 Jan 28, 2025
2cce5d4
replace standard accuracy for standard LID Accuracy, and dialect accu…
shincap8 Jan 28, 2025
142c265
Merge branch 'main' of https://github.com/mlcommons/dynabench into Fe…
shincap8 Feb 6, 2025
1014401
change metric name to show in front end
shincap8 Feb 6, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion api/evaluation/metrics/metrics.py
Original file line number Diff line number Diff line change
Expand Up @@ -175,7 +175,7 @@ def get_STD_CER_meta(task=None):
def get_CER_15_WORSE_meta(task=None):
return {
"unit": "%",
"pretty_name": "CER 15 WORSE",
"pretty_name": "CER 15 WORST",
"utility_direction": -1,
"offset": 0,
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -205,7 +205,7 @@ def get_STD_CER_meta(task=None):
def get_CER_15_WORSE_meta(task=None):
return {
"unit": "%",
"pretty_name": "CER_15_WORSE",
"pretty_name": "CER_15_WORST",
"utility_direction": -1,
"offset": 0,
}
Expand Down
Loading