Big question, BERT ranking need triple to train, but... #7

guotong1988 · 2019-08-07T08:31:55Z

https://msmarco.blob.core.windows.net/msmarcoranking/collectionandqueries.tar.gz
The data above do not contains the negative doc.

@rodrigonogueira4 Thank you!!!

The data is from readme here: https://github.com/nyu-dl/dl4ir-doc2query#ms-marco

rodrigonogueira4 · 2019-08-07T12:28:26Z

To train the se2seq model you only need pairs of queries and relevant documents, hence you don't need negatives.

Please post doc2query-related questions in that repository:
https://github.com/nyu-dl/dl4ir-doc2query

guotong1988 · 2019-08-07T15:10:33Z

But BERT ranking step need the negative doc.

rodrigonogueira4 · 2019-08-07T19:17:18Z

Please note that this repository only contains the code to train doc2query (seq2seq) model. If you want to train BERT re-ranker, please follow the steps in https://github.com/nyu-dl/dl4marco-bert.

Also, please note that training BERT re-ranker on the expanded documents did not give better results than training on the non-expanded (original) documents. I.e., you can use the trained model from https://github.com/nyu-dl/dl4marco-bert to re-rank the expanded documents.

guotong1988 · 2019-08-08T01:22:41Z

Thank you.

guotong1988 closed this as completed Aug 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Big question, BERT ranking need triple to train, but... #7

Big question, BERT ranking need triple to train, but... #7

guotong1988 commented Aug 7, 2019 •

edited

Loading

rodrigonogueira4 commented Aug 7, 2019

guotong1988 commented Aug 7, 2019

rodrigonogueira4 commented Aug 7, 2019

guotong1988 commented Aug 8, 2019

Big question, BERT ranking need triple to train, but... #7

Big question, BERT ranking need triple to train, but... #7

Comments

guotong1988 commented Aug 7, 2019 • edited Loading

rodrigonogueira4 commented Aug 7, 2019

guotong1988 commented Aug 7, 2019

rodrigonogueira4 commented Aug 7, 2019

guotong1988 commented Aug 8, 2019

guotong1988 commented Aug 7, 2019 •

edited

Loading