Releases: embeddings-benchmark/mteb
1.29.1
1.29.0
1.29.0 (2025-01-13)
Ci
-
ci: fix model loading test (#1775)
-
pass base branch into the make command as an arg
-
test a file that has custom wrapper
-
what about overview
-
just dont check overview
-
revert instance check
-
explicitly omit overview and init
-
remove test change
-
try on a lot of models
-
revert test model file
Co-authored-by: Isaac Chung <[email protected]> (9b117a8
)
Feature
-
feat: Update task filtering, fixing bug which included cross-lingual tasks in overly many benchmarks (#1787)
-
feat: Update task filtering, fixing bug on MTEB
- Updated task filtering adding exclusive_language_filter and hf_subset
- fix bug in MTEB where cross-lingual splits were included
- added missing language filtering to MTEB(europe, beta) and MTEB(indic, beta)
The following code outlines the problems:
import mteb
from mteb.benchmarks import MTEB_ENG_CLASSIC
task = [t for t in MTEB_ENG_CLASSIC.tasks if t.metadata.name == "STS22"][0]
# was eq. to:
task = mteb.get_task("STS22", languages=["eng"])
task.hf_subsets
# correct filtering to English datasets:
# ['en', 'de-en', 'es-en', 'pl-en', 'zh-en']
# However it should be:
# ['en']
# with the changes it is:
task = [t for t in MTEB_ENG_CLASSIC.tasks if t.metadata.name == "STS22"][0]
task.hf_subsets
# ['en']
# eq. to
task = mteb.get_task("STS22", hf_subsets=["en"])
# which you can also obtain using the exclusive_language_filter (though not if there was multiple english splits):
task = mteb.get_task("STS22", languages=["eng"], exclusive_language_filter=True)
-
format
-
remove "en-ext" from AmazonCounterfactualClassification
-
fixed mteb(deu)
-
fix: simplify in a few areas (
4a70e5d
)
1.28.7
1.28.6
1.28.5
1.28.4
1.28.4 (2025-01-10)
Fix
-
fix: fixes implementation of similarity() (#1748)
-
fix(#1594): fixes implementation of similarity()
-
fix: add similarity to SentenceTransformerWrapper
Co-authored-by: sam021313 <[email protected]> (3fe9264
)
1.28.3
1.28.2
1.28.1
1.28.1 (2025-01-10)
Fix
-
fix: Leaderboard Speedup (#1745)
-
Added get_scores_fast
-
Made leaderboard faster with smarter dependency graph and event management and caching
-
Changed print to logger.info (
9eff8ca
)
Test
-
test: Add script to test model loading below n_parameters threshold (#1698)
-
add model loading test for models below 2B params
-
add failure message to include model namne
-
use the real get_model_meta
-
use cache folder
-
teardown per function
-
fix directory removal
-
write to file
-
wip loading from before
-
wip
-
Rename model_loading_testing.py to model_loading.py
-
Delete tests/test_models/test_model_loading.py
-
checks for models below 2B
-
try not using cache folder
-
update script with scan_cache_dir and add args
-
add github CI: detect changed model files and run model loading test
-
install all model dependencies
-
dependecy installations and move file location
-
should trigger a model load test in CI
-
find correct commit for diff
-
explicity fetch base branch
-
add make command
-
try to run in python instead and add pytest
-
fix attribute error and add read mode
-
separate script calling
-
let pip install be cached and specify repo path
-
check ancestry
-
add cache and rebase
-
try to merge instead of rebase
-
try without merge base
-
check if file exists first
-
Apply suggestions from code review
Co-authored-by: Kenneth Enevoldsen <[email protected]> -
Update .github/workflows/model_loading.yml
Co-authored-by: Kenneth Enevoldsen <[email protected]> -
address review comments to run test once from CI and not pytest
Co-authored-by: Kenneth Enevoldsen <[email protected]> (8d033f3
)