Skip to content

philiptolk/food2vec

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

food2vec

Food vectors. Live demo at https://altosaar.github.io/food2vec/, blog post with more information and pretty plots here: https://jaan.io/food2vec-augmented-cooking-machine-intelligence/

Data

Get the data and preprocess it:

cd dat
./download_data.sh
# get kaggle data from https://www.kaggle.com/c/whats-cooking/
python preprocess_kaggle_data.py
./preprocess_data.sh

Usage

Train a model on the recipes dataset:

git clone [email protected]:altosaar/food2vec.git
cd food2vec
git submodule update --init
cd src/sentence_word2vec
git submodule update --init
./compile_ops.sh
# run word2vec with the recipes as context
python word2vec_optimized.py --train_data ../../dat/recipes --interactive --save_path /tmp --subsample 0 --eval 0 --interactive --embedding_size 100

Visualization & embedding exploration tools

# make the t-SNE plot
cd ../../
# run t-sne and make the plots for the ingredient embeddings
jupyter notebook ./src/plot_ingredients_recipes.ipynb
# view the analogy tool on a browser
open index.html

Embedding plot.ly plots to host them yourself

https://gist.github.com/altosaar/67d8456ad28acd1abb497f1950d8de8a

Contributing

Pull requests and all feedback welcome! Please file an issue if you run into problems replicating the results.

TODOs:

Acknowledgments

Thanks to Anthony for open-sourcing a javascript embedding browser -- the one here is heavily based on it.

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.9%
  • JavaScript 1.1%
  • Other 1.0%