UnsupervisedBootAL

Unsupervised Bootstrapping of Active Learning for Entity Resolution

Code-Dataset-Results for Paper

Primpeli, Anna, Christian Bizer, and Margret Keuper. "Unsupervised bootstrapping of active learning for entity resolution." European Semantic Web Conference. Springer, Cham, 2020.

Code: Use the Thresholding_Comparison notebook to run the comparison of the different thresholding methods: Elbow Point, Static, Otsu's, Adjusted Valley

Use the AL_Comparison notebook to run our proposed method 'boot' which uses unsupervised matching to bootstrap active learning. In the same notebook you can set on/off the evaluation of the two bsaeline methods of our paper: no_boot, no_boot_warm

Use the Plotting notebook for visualization and plotting of the results

Datasets: In the datasets folder you can find all data sets used for experimenation:

abt_buy
amazon_google
author/DBPediaAuthors_DnbDataAuthors
author/DBPediaAuthors_VIAFDataAuthors
wdc_product/headphones_headphones_catalog
wdc_product/phones_phones_catalog

For every dataset pair we provide the initial datasets, feature vector files and files including matching labels for the train and test sets.

Results: In the results folder you can find all result files for Active Learning methods comparison as presented in the evaluation of our paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

UnsupervisedBootAL

Files

README.md

Latest commit

History

README.md

File metadata and controls

UnsupervisedBootAL