DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization re-ranking for case law retrieval (Dossier Team)
Sophia Althammer, Arian Askari, Suzan Verberne, Allan Hanbury
This repository contains the code for the Dossier participation in the COLIEE 2021 competition "DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization re-ranking for case law retrieval", which is the third team at COLIEE 2021 for case law retrieval (task 1), and fifth team in Legal Case Entailment (task 2).
Please cite our work as follows
@inproceedings{althammer2021dossiercoliee,
title={DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization re-ranking for case law retrieval},
author={Sophia Althammer and Arian Askari and Suzan Verberne and Allan Hanbury},
year={2021},
booktitle={Proceedings of the Eigth International Competition on Legal Information Extraction/Entailment (COLIEE 2021) in association with the 18th International Conference on Artificial Intelligence and Law},
}
The repository is structured as follows:
dpr/
: The Dense Passage Retrieval (DPR) implementation which is based on the facebook research DPR repository (github).cedr/
: Vanilla Bert implementation which is based on the CEDR: Contextualized Embeddings for Document Ranking implementation (github).summarizer/
contains the notebook for generating summary for coliee'21 caselaws using Longformer Encoder-Decorder (LED) modelruns/
: all the final submitted run files, containing noticed cases ranked by (bm25), and proposed method (bm25+lawdpr), vanilla bert for task1.
Please visit COLIEE 2021 to apply for the whole dataset.
Please email [email protected] for the checkpoint of LED summarizer (fine-tuned on COLIEE'18 summaries).
If you have any questions, please email [email protected] or [email protected].