Skip to content

Python package to manage a variety of pandem sources

License

Notifications You must be signed in to change notification settings

forchard/pandem-source

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PANDEM - Source

Integrate heterogeneous publuc health surveillance data sources into a coherent data base for PANDEM-2 project.

Description

Identify, map and integrate multiple pandemic related data into a coherent pandemic-management database. Developed within the H2020 project PANDEM-2, Pandem-Source allows users to systematically capture, standardize and analyze data coming from international and national surveillance databases, participatory surveillance projects, social networks and mass media. This tool is focused on flexibility so adding new sources or variables can be easily done as is required during a pandemic episode.

Target users

Public health data experts and data managers needing integrate several surveillance data.

Approach

Data integration follow a semantic approach. Data sources need to be described using a Data Labelling Schema (DLS) file that specifies the acquisition chennel (URL, git, local file or script) the format and how to map the input data to Pandem-Source variables. If further customistions are needed, the user can define them using simple python scripts.

The DLS ensures each source and variable is properly documented knwing its origin, meaning and data quality.

Supported sources

Pandem-Source is designed to be flexible and extensible so new sources can be easily added by end users. In order to demontstrate this principle. The following sources are supported Out of the Box

Sources for indicators

Sources for referentials

User Interface - timeseries explorer

Time series

User Interface - Integration dashboard

Integration dashboard

Requirements

  • Python 3.7 or higher
  • R 3.6.3 or higher
  • Docker (optional for supporting machine learning article classification)

Installing from pip

If you want to customize the installation folder (defaulted to ~/.pandemsource) you need to set the environment variable PANDEM_HOME to a different folder

pip install pandem-source

python -m pandemsource setup --install

Installing for contributors

If you want to customize the installation folder (defaulted to ~/.pandemsource) you need to set the environment variable PANDEM_HOME to a different folder

git clone https://github.com/pandem2/pandem-source

cd pandem-source

make init

make install

source env/bin/activate

python -m pandemsource setup --install

Defining sources o monitor

PANDEM-Source comes with a list of predefined sources it can monitor, but they are not all enabled by default. In order to use them them you have to manually activate them with the 'setup' command.

See all avaiable sources

python -m pandemsource setup -h

Activating a particular source

# Activating ECDC covid19 dataset monitoring
python -m pandemsource setup --ecdc-covid19

# Activating preloaded 2023 functional excercise data
python -m pandemsource setup --pandem-2-2023-fx

Running Pandem-Source

  • Set the PANDEM_HOME variable to a local foder (only if you have customized the default installation folder)
export PANDEM\_HOME=your data folder here 
  • Running monitoring and dashboard
python -m pandemsource start -d 

Troubleshooting

If the command python -m pandemsource start -d --no-nlp does not work. Please make sure you've installed R 3.6.3 or higher. If R is properly installed, please make sure you've also installed required dependencies with:

(In your terminal)

sudo apt install libxml2-dev libsodium-dev libssl-dev libcurl4-openssl-dev libgdal-dev libfontconfig1-dev libharfbuzz-dev libfribidi-dev 

(In the R interpreter)

install.packages(c("epitweetr", "dplyr", "shiny", "plotly", "DT", "jsonlite", "httr", "XML", "ggplot2", "epitweetr", "reticulate", "seqinr", "readr"))

PANDEM 2

PANDEM-2 is a H2020 EU-funded project that aims to develop new solutions for efficient, EU-wide pandemic management. The goal of PANDEM-2 is to prepare Europe for future pandemics through innovations in training and to build capacity between EU member states responding to pandemics on a cross-border basis.

About

Python package to manage a variety of pandem sources

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 77.7%
  • R 22.0%
  • Other 0.3%