Skip to content

aueb-wim/DataQualityControlTool

Repository files navigation

AUEB Codacy Badge License Coverage Status

HBP-MIP Data Quality Control Tool

Description

MIP DQCT logo

MIP Data Quality Control Tool (MIP-DQC Tool) is a software developed for the Human Brain Project Medical Informatics Platform (HBP-MIP) and it main perpose is to provide hospital personel an easy way to explore, validate and transform their data before uploading them into the MIP. MIP-DQC Tool has both, a Command Line Interface (CLI) and a Graphical User Interface (GUI) but only the latter one has the full set of tool's functionalities.

MIP-DQC Tool GUI version has the following functionalities:

  1. Validating the hospital tabular (csv) data and producing report with validation results and some overall statistics about the data.
  2. Data cleaning capability based on the previously performed validation results.
  3. Inference of a dataset's schema and producing a schema file in Frictionless json or MIP Data Catalogue's excel format.
  4. Designing and performing schema mapping of an incoming hospital dataset to a certain Pathology's Common Data Element (CDE) schema.
  5. Producing DICOM MRIs validation and statistical report based on their meta-data headers.

The tabular (csv) data validation functionality has the option of downloading pathologies CDE metadata directly from the MIP Data Catalogue's API. Please note, this option is not available in the CLI version.

The schema mapping functionality is performed by the MIPMap engine packaged in a Docker container which runs in the background. Please note, this option is not available in the CLI version.

Suported OS

  • Linux (tested for Ubuntu)
  • Windows 10 with WLS 2
  • MacOS

Please refer to the Installation Wiki Section for further details.

Versioning

We use SemVer for versioning.

Authors

  • Iosif Spartalis - AUEB/RC Data Science Team @iosifsp

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details

Acknowledgements

This project/research received funding from the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Framework Partnership Agreement No. 650003 (HBP FPA) as part of SP8 of the Human Brain Project (SGA2), and under the Specific Grant Agreement No. 945539 (Human Brain Project SGA3).

The app logo is based on a design by the user Eucalyp, from Flaticon website

Special thanks to:

  • Prof. Vasilis Vassalos - Athens University of Economics and Business
  • Kostis Karozos - AUEB/RC Data Science Team, Ph.D candidate @Kostis-K
  • Abu-Nawwas Laith - CHUV @lanawwas
  • Jacek Manthey - CHUV