-
Notifications
You must be signed in to change notification settings - Fork 4
NIZO
#Gut-commensals with a health benefit#
##Use case description# Goal: Identify novel relationships between gut-commensals and health benefits#
Workflow: the query will start with a list of bacteria to find direct and indirect relations with a list of health benefits. Intermediate concepts relevant to this search: other bacteria and/or metabolites/food components. Possible extensions: bacterial metabolic pathways, human physiological pathways and genes.
Search result output: clickable HTML pages; interactive graph.
Additional functionality: an automated search and reporting system generating state-of-the-art overviews on a regular basis supplemented with an alerting system of new findings.
##Pilot workflow# A pilot workflow (old EKP API) was generated to search for relations between 10 commensal and 6 health benefit terms. These input terms were automatically mapped to concepts in EKP and the workflow resulted in two direct relations. For comparison, co-occurrence text-mining revealed 16 relations between 10 commensals and 4 (out of the 6) health benefit terms in Medline abstracts. Possible explanations for the different result:
- automatic mapping of an input term to a concept in EKP is matching 30-80%.
- UMLS is focussed on human disease rather than health benefits.
- UMLS has concepts for bacteria at species level and only few? at strain level.
- Semantic Medline, the data source of these relations, is a repository of semantic predications (subject-predicate-object triples) extracted from Medline. Semantic Medline does not contain bacterium - health benefit relations, as extraction is restricted to specific concept-relation-concept patterns.
- the current version of the Euretos Knowledge Platform is biased to data related to human disease and pharmacological treatment.
To identify indirect interactions, compounds (by semantic category?) or human diseases (by semantic category?) were added to the workflow as intermediate concepts. In addition, these concepts should have a label/attribute 'gut' OR 'intestines'. This search produced more relations, however, most compounds identified were very general, like 'DNA', 'glucose' and 'growth factor'. None of the indirect relations analyzed made sense. For example, E.coli was used as a vehicle to produce a certain growth factor in one abstract and linked by the concept 'growth factor' to a relation of a different growth factor with a certain health benefit in another abstract.
Based on the findings above, the term lists were supplemented with manually assigned UMLS CUIs and semantic types. Furthermore, additional manually curated data sources were identified for ingestion in EKP to include data more relevant to the use case.
ODEX4all