You are what you eat - Relating Demographic Data to Food Consumption Habits

Abstract

The original paper presents the Tesco Grocery 1.0 data set and verifies the data by correlating the typical food product with the prevalence of different metabolic diseases. We are interested in the influence of demographic data on food composition, more specifically, we want to predict the contents of the typical food product of each ward by demographic markers such as gender, age, race, and wealth. The UK government provides ward profiles with the aforementioned demographic markers. We can merge the grocery data and the demographic data by the ward identifiers. To explore the interplay between demographic data and food consumption habits, we first explore the data and conduct a correlation analysis. Afterward, we build a model that, given demographic markers, predicts the contents of the typical food product consumed in a ward. Lastly, we plan to explore the validity of our model. Our analysis would allow us to better understand the consumption habits of different population groups.

Research questions

What is the relation between each individual demographic marker and food consumption habits?
How well can we predict food consumption habits from demographic markers?
How does data representativeness affect our model performance?

Proposed datasets

Tesco Grocery 1.0 from the paper -- this dataset provides the nutrients of the typical food product on different spatial granularities. Ward Atlas -- this dataset provides several demographic features at the ward level. Specifically, we will use gender, wealth, age, and race.

Methods

Data collection: enrich the Tesco dataset with demographic data from the Ward Atlas dataset.

Data analysis: Once we have demographic data and food datasets merged we will proceed to the analysis of correlations among different properties groups we are interested in, i.e. dependence of protein consumption on median income. After exploring the correlations we will build our model, which should predict the distribution of meal constituents on demographic data for every area.

Building the model: we will build a neural network to predict the typical food product’s ingredients We are going to explore different model configurations, i.e. Loss functions and activation functions.

Validation: We study the dependence of the model’s loss on the representativeness of the training data.

Proposed timeline

Week 1: Downloading and merging the data sets, doing a sanity check. Search for the correlations and visualize properties
Week 2: Build the model that should predict consumption habits from demographic markers.
Week 3: Study performance of the models obtained, prepare data story

Organization within the team

Alex: Data pre-processing: downloaded data, cleaned and transformed data, correlations analysis, Setting Up data story, Writing pre-processing part of data story
Egor: Finding reliable nutrients to predict, built linear models, built gradient boost models, compared performance of models, merged notebook for final submission, wrote data story part about finding the most important features
Denis: built procedure for feature selection, built neural net, built linear models, compared performance, did representativeness analysis

Contributors ✨

_Alex
💻

_Egor
💻

_Denis
💻

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Figures		Figures
.gitignore		.gitignore
Project_Extension_Egor.ipynb		Project_Extension_Egor.ipynb
README.md		README.md
atlas.pickle		atlas.pickle
pre-processing.ipynb		pre-processing.ipynb
ward_atlas.pickle		ward_atlas.pickle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

You are what you eat - Relating Demographic Data to Food Consumption Habits

Abstract

Research questions

Proposed datasets

Methods

Proposed timeline

Organization within the team

Contributors ✨

About

Releases

Packages

Languages

egorssed/you-are-what-you-eat

Folders and files

Latest commit

History

Repository files navigation

You are what you eat - Relating Demographic Data to Food Consumption Habits

Abstract

Research questions

Proposed datasets

Methods

Proposed timeline

Organization within the team

Contributors ✨

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages