Skip to content

Latest commit

 

History

History
32 lines (23 loc) · 2.08 KB

README.md

File metadata and controls

32 lines (23 loc) · 2.08 KB

DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization re-ranking for case law retrieval (Dossier Team)

Sophia Althammer, Arian Askari, Suzan Verberne, Allan Hanbury

This repository contains the code for the Dossier participation in the COLIEE 2021 competition "DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization re-ranking for case law retrieval", which is the third team at COLIEE 2021 for case law retrieval (task 1), and fifth team in Legal Case Entailment (task 2).

Please cite our work as follows

@inproceedings{althammer2021dossiercoliee,
      title={DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization re-ranking for case law retrieval}, 
      author={Sophia Althammer and Arian Askari and Suzan Verberne and Allan Hanbury},
      year={2021},
      booktitle={Proceedings of the Eigth International Competition on Legal Information Extraction/Entailment (COLIEE 2021) in association with the 18th International Conference on Artificial Intelligence and Law},
}

Structure

The repository is structured as follows:

  • dpr/: The Dense Passage Retrieval (DPR) implementation which is based on the facebook research DPR repository (github).
  • cedr/: Vanilla Bert implementation which is based on the CEDR: Contextualized Embeddings for Document Ranking implementation (github).
  • summarizer/ contains the notebook for generating summary for coliee'21 caselaws using Longformer Encoder-Decorder (LED) model
  • runs/: all the final submitted run files, containing noticed cases ranked by (bm25), and proposed method (bm25+lawdpr), vanilla bert for task1.

Data

Please visit COLIEE 2021 to apply for the whole dataset.

Please email [email protected] for the checkpoint of LED summarizer (fine-tuned on COLIEE'18 summaries).

Contact

If you have any questions, please email [email protected] or [email protected].