You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For each word in the text retrieve clusters from ElasticSearch it belongs to. Assign category to the word. Example of the output format:
Darmstadt CITY NamedEntityInText
is
a
nice
city. CITY NamedEntityInText
John NAME NamedEntityInText
Smith SURNAME NamedEntityInText
is
a
well-known
layer.
For each occurrence of the tag in the text, manually count precision as the number of correct tags vs the number of all tags.
The text was updated successfully, but these errors were encountered:
Motivation
A preliminary evaluation of quality of clustering based on named entity recognition task. Deadline -- 16 of december.
Implementation
Select manually from the results clusters that correspond to
To select clusters look for keywords that are unambigous e.g. Pepsi or Javascript or Robert.
Create an ElasticSearch index with all these clusters. Add as an attribute corresponding category. Each category can have several attributes.
Download the texts here (the xml files reuters.xml and 500news.xml) https://github.com/AKSW/n3-collection
Parse the xml files to get the plain text.
For each word in the text retrieve clusters from ElasticSearch it belongs to. Assign category to the word. Example of the output format:
For each occurrence of the tag in the text, manually count precision as the number of correct tags vs the number of all tags.
The text was updated successfully, but these errors were encountered: