Skip to content

A simple tool for evaluating genetic risk associated with severity of COVID-19 infection

License

Notifications You must be signed in to change notification settings

Geromics/covcheck

Folders and files

NameName
Last commit message
Last commit date

Latest commit

cea890d · Jun 16, 2022

History

51 Commits
Jun 16, 2022
Mar 21, 2022
Jan 29, 2021
Jun 16, 2022
Jan 29, 2021
Jun 16, 2022
Nov 23, 2020
Jun 13, 2022
Jun 13, 2022
Jun 16, 2022
Jun 13, 2022
Dec 8, 2020
Jun 13, 2022

Repository files navigation

What is CovCheck?

The CovCheck analysis computes a personal genome report indicating 'risk of severe symptoms' (CFR) from COVID-19 infection.

The risk analysis is based on published, peer-reviewed studies: https://www.covid19hg.org/publications/

Personal genome data and age (if provided) is read from a simple 'genome file' in JSON format.

Note that this analyis is work in progress!!!

CI/CD

Python package

Installation and Usage

To install, pull from git:

git@github.com:Geromics/covcheck.git

change to the appropriate directory:

cd covcheck

and run the code:

python3 covid_genetic_check.py -h

usage: check.py [-h] [--version] [--verbose] infile [outfile]

Score an individual.

positional arguments:
  infile         JSON format file containing individual data
  outfile        JSON format results file (default: <stdout>)

optional arguments:
  -h, --help     show this help message and exit
  --version      show program's version number and exit
  --verbose, -v

The only formal requirements (requirements.txt) are for testing:

pip install -r requirements.txt

which is done by:

python3 -m pytest -v .

Note, project dependncies are not managed by a high level tool such as flit, poetry or Pipenv.

Analysis details

The analysis of risk with age is based on data from here:

and has been done with help from:

The genome report is currently based on the analysis in this preprint: https://www.researchsquare.com/article/rs-37798/v1

Notes from the preprint...

##reference=GRCh38.p12
chr21 41480570 rs12329760 C T . . .
chr21 41507982 rs75603675 C A . . .

rs12329760, TMPRSS2(-), V197M, C->T (G->A, V[GTG] -> M[ATG])
C is the risk allele, T is 'protective'

rs75603675, TMPRSS2(-), G008V, C->A (G->T, G[GGT] -> V[GTT])
A is the risk allele, C is 'protective'

Spearman’s correlation with COVID-19 CFR:
ρ = -0.464, P = 0.0157 for V197M C->T (G->A)
ρ = +0.713, P = 0.0018 for G008V C->A (G->T)

The higher the score, the greater your genetic risk of severe
COVID-19 infection.

Additional references

COVID:

Polygenic Risk Scores:

Papers studying the genetics of COVID

Random notes

A common pattern to mark an unfinished code is to raise a NotImplementedError that is noticed at runtime:

@classmethod
def from_api(cls, url):
    raise NotImplementedError  # TODO

TODO: Use a few tools to automatically take care of code formatting (black), flag style-related problems (flake8), as well as warn about potential bugs (pylint).

About

A simple tool for evaluating genetic risk associated with severity of COVID-19 infection

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published