Skip to content

Latest commit

 

History

History
107 lines (87 loc) · 7.54 KB

README.md

File metadata and controls

107 lines (87 loc) · 7.54 KB

Data Description

Nexoid COVID-19 Survival Calculator Vulnerability Data

Nexoid is a software company based in London. It has a data research team that uses collected user data to calculate the risk index of COVID-19 to people with different vulnerabilities.

Data is gathered from the survival calculator, which is a survey that asks questions on biometrics, behaviors, living environments, medical records, etc. Nexoid then analyzes it and calculates the infection rate and mortality rate per different factors, based on over 820,000 entries of submitted data. Read the methodology.

Dataset

The survival calculator provides a complete dataset of all responses submitted. It contains information such as the respondents'

  • geographical information
  • behavior
  • segmentation
  • health conditions
  • medications
  • risk values

For complete details of all questions asked, visit the COVID-19 Survival Calculator.

Data is updated periodically, and is available back till March 24th, 2020.

Accessing Data

You can download all responses at the survival calculator's website. (~180 MB total)

Metadata describing the response analysis are available as separate files, listed together with the master dataset.

Last updated September 25th, 2020.

Structure

All survey responses are recorded in master_dataset.csv.

Field Description

Note: Description is not yet complete, awaiting contribution.

Field Description Type Example
survey_date Date on which the data was collected, in the format M/D/YYYY string 3/24/2020
region 2-letter continent code in which the data was collected (based on IP address). Possible values are:
  • AF for Africa
  • AN for Antarctica
  • AS for Asia
  • EU for Europe
  • NA for North America
  • OC for Oceania
  • SA for South and Central America
string NA
country 2-letter country code representing the country in which the data was collected (based on IP address) string US
ip_latitude Latitude associated with the IP address of submission string 38.5415
ip_longitude Longitude associated with the IP address of submission string -121.4968
ip_accuracy Accuracy of the geolocation associated with the IP address of submission, in terms of the maximum distance (in km) of error integer 5
sex Sex of the respondent. Possible values are: male and female string female
age Age range of the respondent, in intervals of 10 years string 40_50
height Height of the respondent, in cm integer 170
weight Weight of the respondent, in kg integer 102
bmi Body mass index of the respondent float 35.2
blood_type Blood type of the respondent. Possible values are:
  • ap for A+
  • an for A-
  • bp for B+
  • bn for B-
  • abp for AB+
  • abn for AB-
  • op for O+
  • on for O-
  • unknown
string bp
insurance Does the respondent have private health insurance? Possible values are: yes and no string yes
income Income level of the respondent. Possible values are:
  • high
  • med
  • low
  • blank for respondent on social welfare/government support
string high
race Race of the respondent. Possible values are:
  • white
  • mixed
  • asian
  • black
  • hispanic
  • other
string white
immigrant Is the respondent a native or an immigrant to the country he/she is in? Possible values are native and immigrant. string native
smoking Does the respondent smoke or vape? Possible values are:
  • never for never smoked or vaped
  • vape for vaping or using e-cigarettes
  • yeslight for light smoking (1-5 per day)
  • yesmedium for medium smoking (6-20 per day)
  • yesheavy for heavy smoking (>20 per day)
  • quit0 for recently quit
  • quit5 for quit >5 years ago
  • quit10 for quit >10 years ago
string quit10
alcohol Number of the days the respondent has consumed alcohol over the past 14 days. Values range from 0 to 14.

-1 is reserved for respondents who has never drunk alcohol.
integer 4
cannabis Number of the days the respondent has consumed cannabis (marijuana) over the past 28 days. Values range from 0 to 28.

-1 is reserved for respondents who has never consumed cannabis.
integer 6
amphetamines Number of the days the respondent has consumed amphetamines (ice, speed) over the past 28 days. Values range from 0 to 28.

-1 is reserved for respondents who has never consumed amphetamines.
integer -1
cocaine Number of the days the respondent has consumed cocaine over the past 28 days. Values range from 0 to 28.

-1 is reserved for respondents who has never consumed cocaine.
integer -1
lsd Number of the days the respondent has consumed LSD (acid) over the past 28 days. Values range from 0 to 28.

-1 is reserved for respondents who has never consumed LSD.
integer -1
mdma Number of the days the respondent has consumed MDMA (ecstacy) over the past 28 days. Values range from 0 to 28.

-1 is reserved for respondents who has never consumed MDMA.
integer -1
contacts_count Number of people the respondent was in close contact with over the past week. Values range from 0 to 20.

21 is reserved for >20 contacts.
integer 21
house_count
public_transport_count
working
worried
rate_reducing_risk_single
rate_reducing_risk_single_social_distancing
rate_reducing_risk_single_washing_hands
rate_reducing_risk_house
rate_reducing_risk_house_social_distancing
rate_reducing_risk_house_washing_hands
rate_reducing_risk_house_sanitizer
rate_reducing_mask
rate_reducing_mask_type
rate_reducing_government_action
rate_reducing_government_control
rate_reducing_government_spend
covid19_positive
covid19_symptoms
covid19_contact
asthma
kidney_disease
liver_disease
compromised_immune
heart_disease
lung_disease
diabetes
hiv_positive
hypertension
other_chronic
nursing_home
health_worker
prescription_medication
opinion_infection
opinion_mortality
risk_infection
risk_mortality

Note: Fields may be empty.

Attribution

This dataset is under the CC BY 4.0 license. You are free to use it for personal, educational, research, and commercial use provided you attribute the dataset to Nexoid.