Test data format #44

Janai2019 · 2019-11-25T00:34:56Z

X_test = [sent2features(s) for s in test_sents]
Looking at the format of the test data, it seems to require a tagged test data to extract features especially, current tag. In reality, the purpose is to tag new data where such information is not present except word features. How do we tag new data?

mani2106 · 2020-02-20T05:29:52Z

Normally you would tag a set of sentences and split them to train and test/eval sets.
To ensure that the model does not overfit (memorize) the training data. We predict with the test data and calculate the scores/metrics and decide whether it is suitable for real-world data.

This is what the example in the documentation does.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test data format #44

Test data format #44

Janai2019 commented Nov 25, 2019

mani2106 commented Feb 20, 2020

Test data format #44

Test data format #44

Comments

Janai2019 commented Nov 25, 2019

mani2106 commented Feb 20, 2020