CRF Output #6

prakhar21 · 2016-07-15T12:16:56Z

Hi, I am not able to understand to what does these tab separated fields mean.

1            I1      L8      NoCAP  NoPAREN  B-QTY
cup          I2      L8      NoCAP  NoPAREN  B-UNIT
white        I3      L8      NoCAP  NoPAREN  B-NAME
wine         I4      L8      NoCAP  NoPAREN  I-NAME

Please, help me out.

Thanks

The text was updated successfully, but these errors were encountered:

ericagreene · 2016-07-19T19:34:48Z

@prakhar21 Those are a list of the tokens (words) and the associated features. The associated code is here. The on the right is the tag that we're trying to predict.

Does that answer your question?

prakhar21 · 2016-07-26T06:39:17Z

@ericagreene Thanks, that answers my question. There is one more thing that, I wanted to clarify.
When I am training on all 180k data and then using my own dataset as validation then, why is it like the predictions that it made with 20k data model are more accurate compared to 180k data model. This is against model training principles. My understanding says, more data is always good for training purpose. Please, share your thoughts on this.

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRF Output #6

CRF Output #6

prakhar21 commented Jul 15, 2016

ericagreene commented Jul 19, 2016

prakhar21 commented Jul 26, 2016

CRF Output #6

CRF Output #6

Comments

prakhar21 commented Jul 15, 2016

ericagreene commented Jul 19, 2016

prakhar21 commented Jul 26, 2016