forked from cltl/pepper
-
Notifications
You must be signed in to change notification settings - Fork 2
2.3 Structured data
Selene Baez edited this page May 25, 2020
·
2 revisions
- Entity/Predicate types (subject and objects): In general, recognized/parsed
types
are the ones present in the ontology. However, it is also possible to have unknown types, in which case these types will be created in the ontology. - Case folding: All fields (type, label, author, and attribution) should be lowercase, except for
raw
. Snake case (replacing spaces with underscores) should be also followed when needed. - Positions: It is necessary to keep track of where in the raw input was the subject/predicate/object mentioned. As such, for an utterance as 'Piek likes pizza' should return
positions
like '0-3', '5-9' and '11-15' respectively. - Entity URI: Use full valid URIs for IDs (i.e. http://cltl.nl/leolani/world/piek)