convert conll dataset format #78

ghost · 2019-10-31T10:17:58Z

Hi
I really appreciate if you could assist me with this quesiton, I would like to convert the conll dataset format to NLI dataset format, in whcih one has one sentence, and replace the pronoun with each of the two antecedent, and then the correct one is entailment label and incorrect one is contradiction. I have two questions:

which information in the conll dataset your code uses? Do you also use cluster information and speaker id? I am really confused by all of these extra information and not sure if this is a part of your method.
I really appreciate to tell me how I can convert the conll dataset to the NLI format, is there any codes for this?
if one train the conll dataset like NLI with BERT model, do you think the performance could possibly suffer? I am wondering which extra information your code uses and if they have an impact?
thanks.

henryhust · 2019-12-06T05:31:03Z

The first question I can anwser you , It uses clusters, speakers, genres as features, but the speakers and genres is not necessary.

henryhust · 2020-06-24T01:18:56Z

The second question maybe solved by https://zhuanlan.zhihu.com/p/121786025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert conll dataset format #78

convert conll dataset format #78

ghost commented Oct 31, 2019 •

edited by ghost

Loading

henryhust commented Dec 6, 2019

henryhust commented Jun 24, 2020

convert conll dataset format #78

convert conll dataset format #78

Comments

ghost commented Oct 31, 2019 • edited by ghost Loading

henryhust commented Dec 6, 2019

henryhust commented Jun 24, 2020

ghost commented Oct 31, 2019 •

edited by ghost

Loading