diff --git a/content/03.07.data-portals.md b/content/03.07.data-portals.md index 57d14e4..c1115c9 100644 --- a/content/03.07.data-portals.md +++ b/content/03.07.data-portals.md @@ -9,7 +9,9 @@ FAIDARE () is a data discovery portal providing a biologist friendly search system over a global federation of 33 plant research databases. It allows to identify data resources using a full text approach completed with domain specific filters and to link back to the original database for visualization, analysis and download. For instance, it is possible to search for "wheat drought" then to refine the search to the "Triticum aestivum" taxon and yield component traits such as "Thousand Grain Weight". The indexed data types are very broad and include genomic features, such as genes or transposable elements, selected bibliography, QTL, markers, genetic variation studies, phenomic studies and plant genetic resources ie germplasm. This inclusiveness is achieved thanks to a two stage indexation data model. The most generic one provides basic search functionalities and relies on five fields : name, link back URL, data type, species and exhaustive description. The filtering is directly tied to some of those fields. Therefore, to provide more advanced filtering, FAIDARE is also providing a second stage indexation mechanism by taking advantage of BrAPi endpoints to get more detailed metadata on genotyping and phenotyping studies as well as germplasm. In parallel, FAIDARE provides a pre-visualization of germplasm and studies using dedicated cards. -The indexation mechanism relies on a dedicated public software () that allows data resources manager to request the indexation of there database using pull requests. It is able to extract data from any BrAPI 1.3 and 1.2 endpoint and development of BrAPI 2.x indexation will be initiated in 2025. Since not all databases are willing to implement BrAPI endpoints, we also provide the possibility to generate metadata as BrAPI json files, hence using the standard as a file exchange format. +![Figure FAIDARE Federation](images/Schema_FAIDARE.png){#fig:Schema_FAIDARE width="100%"} +The indexation mechanism relies on a dedicated public software () that allows data resources manager to request the indexation of there database using pull requests. This BrAPI client is able to extract data from any BrAPI 1.3 and 1.2 endpoint and development of BrAPI 2.x indexation will be initiated in 2025. Since not all databases are willing to implement BrAPI endpoints, we also provide the possibility to generate metadata as BrAPI json files, hence using the standard as a file exchange format. +FAIDARE architecture has been designed by elaborating on the GnpIS Software Architecture [@doi:10.34133/2019/1671403]. As a consequence, BrAPI is at the core of its datamodel, and in particular the JSON data files served by the Elasticsearch NoSQL engine are enriched version of the BrAPI JSON files. FAIDARE also includes a BrAPI endpoint that serves all indexed metadata. FAIDARE has been adopted by several communities and in particular in the ELIXIR and EMPHASIS european infrastructures. It is also used by the WheatIS of the Wheat-Initiative. Several databases are added each year to the FAIDARE global federation, allowing to increase both the portal and the BrAPI adoption. #### Phenospex - HortControl @@ -27,9 +29,10 @@ The Scientific Advisory Committee of the International Treaty and the Governing #### FLORILÈGE (Gateway to French Plant Genetic Resources) -Designed primarily for the general public, Florilège provides access to all French plant biological resources centers. Its interface allows individuals to browse available plant accessions and gives them the possibility to order them. The listed accessions originate from 19 resources centers and concern around fifty plant species. +Designed primarily for the general public, Florilège provides access to public collections of all French plant biological resources centers. This web portal allows to browse available plant genetic resource accessions and gives the possibility to order some seeds or plant material for cultivation. It includes plant genetic resources of around fifty plant genus from 19 genebanks. -Florilège retrieves accession information from different BrAPI-compliant systems such as OLGA, an internal accessions management system, or FAIDARE. Leveraging the BrAPI implementation of these systems ensures standardized data retrieval from multiple sources, making the integration of new data sources that implement BrAPI an effortless process. The implementation of BrAPI is a prerequisite for the integration of any new database in Florilège. +Florilège retrieves accession information from different BrAPI-compliant systems. They include OLGA, a genebank accessions information management system, and GnpIS[@doi:10.34133/2019/1671403] [@doi:10.1007/978-1-4939-6658-5_5], an INRAE data repository for plant genetic resources, phenomics and genetics. Using BrAPI to gather data from these systems reduced the efforts and enabled standardized data retrieval. As a consequence, BrAPI is the de facto standard for exchanging data within the French plant genetic resources community. The Florilège team also requested several update of the BrAPi specifications to better serve this use case, such as Collection or improved external references. +![Figure Florilege Workflow](images/Schema_Florilege.jpg){#fig:Schema_Florilege width="100%"} Florilège is developed in Drupal 10, and uses xnttbrapi module (to easily connect to BrAPI compliant external databases). diff --git a/content/images/Schema_FAIDARE.png b/content/images/Schema_FAIDARE.png new file mode 100644 index 0000000..15f3dd8 Binary files /dev/null and b/content/images/Schema_FAIDARE.png differ diff --git a/content/images/Schema_Florilege.jpg b/content/images/Schema_Florilege.jpg new file mode 100644 index 0000000..6c0113e Binary files /dev/null and b/content/images/Schema_Florilege.jpg differ diff --git a/content/metadata.yaml b/content/metadata.yaml index a5cdf44..f0793d2 100644 --- a/content/metadata.yaml +++ b/content/metadata.yaml @@ -199,7 +199,8 @@ authors: email: erwan.le-floch@inrae.fr initials: ELF affiliations: - - URGI PlantBioinfoPF, INRAE France + - Université Paris-Saclay, INRAE, BioinfOmics, Plant Bioinformatics Facility, Versailles, France + - Université Paris-Saclay, INRAE, URGI, Versailles, France - name: Jospeh Ruff email: J.Ruff@kew.org initials: JR @@ -208,8 +209,36 @@ authors: - name: Michael Alaux email: michael.alaux@inrae.fr initials: MA + orcid: 0000-0001-9356-4072 + affiliations: + - Université Paris-Saclay, INRAE, BioinfOmics, Plant Bioinformatics Facility, Versailles, France + - Université Paris-Saclay, INRAE, URGI, Versailles, France + - name: Célia Michotey + email: celia.michotey@inrae.fr + initials: CM + orcid: 0000-0003-1877-1703 + affiliations: + - Université Paris-Saclay, INRAE, BioinfOmics, Plant Bioinformatics Facility, Versailles, France + - Université Paris-Saclay, INRAE, URGI, Versailles, France + - name: Anne-Francoise Adam-Blondon + email: anne-francoise.adam-blondon@inrae.fr + initials: AFAB + orcid: 0000-0002-3412-9086 affiliations: - - URGI PlantBioinfoPF, INRAE France + - Université Paris-Saclay, INRAE, BioinfOmics, Plant Bioinformatics Facility, Versailles, France + - Université Paris-Saclay, INRAE, URGI, Versailles, France + - name: Jeremy Destin + email: jeremy.destin@inrae.fr + initials: JD + affiliations: + - Université Paris-Saclay, INRAE, BioinfOmics, Plant Bioinformatics Facility, Versailles, France + - Université Paris-Saclay, INRAE, URGI, Versailles, France + - name: Maud Marty + email: maud.marty@inrae.fr + initials: MM + affiliations: + - Université Paris-Saclay, INRAE, BioinfOmics, Plant Bioinformatics Facility, Versailles, France + - Université Paris-Saclay, INRAE, URGI, Versailles, France - name: Suman Kumar email: kumar@ipk-gatersleben.de initials: SK