Integrate heterogeneous publuc health surveillance data sources into a coherent data base for PANDEM-2 project.
Identify, map and integrate multiple pandemic related data into a coherent pandemic-management database. Developed within the H2020 project PANDEM-2, Pandem-Source allows users to systematically capture, standardize and analyze data coming from international and national surveillance databases, participatory surveillance projects, social networks and mass media. This tool is focused on flexibility so adding new sources or variables can be easily done as is required during a pandemic episode.
Public health data experts and data managers needing integrate several surveillance data.
Data integration follow a semantic approach. Data sources need to be described using a Data Labelling Schema (DLS) file that specifies the acquisition chennel (URL, git, local file or script) the format and how to map the input data to Pandem-Source variables. If further customistions are needed, the user can define them using simple python scripts.
The DLS ensures each source and variable is properly documented knwing its origin, meaning and data quality.
Pandem-Source is designed to be flexible and extensible so new sources can be easily added by end users. In order to demontstrate this principle. The following sources are supported Out of the Box
- COVID19 Data Hub see source definition
- ECDC Atlas see source definition
- ECDC COVID19 Datasets see source definitions for variants, age group, daily cases, goverments measures, vaccination
- ECDC COVID19 Simulated data see source definition
- Serotracker see source definition
- Open Sky Nerwork see source definition
- Influenza Net see source definition
- MediSys see source definition
- Twitter see source definition
- User provided data. You can use this data template to easily integrate your own real or ficticious datasets.
- Eurostats NUTS see source definition
- Our airports see source definition
- Geonames see source definition
- ICD-10-CM see source definition
- Python 3.7 or higher
- R 3.6.3 or higher
- Docker (optional for supporting machine learning article classification)
If you want to customize the installation folder (defaulted to ~/.pandemsource) you need to set the environment variable PANDEM_HOME to a different folder
pip install pandem-source
python -m pandemsource setup --install
If you want to customize the installation folder (defaulted to ~/.pandemsource) you need to set the environment variable PANDEM_HOME to a different folder
git clone https://github.com/pandem2/pandem-source
cd pandem-source
make init
make install
source env/bin/activate
python -m pandemsource setup --install
PANDEM-Source comes with a list of predefined sources it can monitor, but they are not all enabled by default. In order to use them them you have to manually activate them with the 'setup' command.
python -m pandemsource setup -h
# Activating ECDC covid19 dataset monitoring
python -m pandemsource setup --ecdc-covid19
# Activating preloaded 2023 functional excercise data
python -m pandemsource setup --pandem-2-2023-fx
- Set the PANDEM_HOME variable to a local foder (only if you have customized the default installation folder)
export PANDEM\_HOME=your data folder here
- Running monitoring and dashboard
python -m pandemsource start -d
- Accessing the dashboard from http://localhost:8001 to see the progress and integrated time series
If the command python -m pandemsource start -d --no-nlp
does not work. Please make sure you've installed R 3.6.3 or higher. If R is properly installed, please make sure you've also installed required dependencies with:
(In your terminal)
sudo apt install libxml2-dev libsodium-dev libssl-dev libcurl4-openssl-dev libgdal-dev libfontconfig1-dev libharfbuzz-dev libfribidi-dev
(In the R interpreter)
install.packages(c("epitweetr", "dplyr", "shiny", "plotly", "DT", "jsonlite", "httr", "XML", "ggplot2", "epitweetr", "reticulate", "seqinr", "readr"))
PANDEM-2 is a H2020 EU-funded project that aims to develop new solutions for efficient, EU-wide pandemic management. The goal of PANDEM-2 is to prepare Europe for future pandemics through innovations in training and to build capacity between EU member states responding to pandemics on a cross-border basis.