Skip to content

davehand/yelp_data_visualization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

yelp_dataset

CIS 550 Project Using Yelp Dataset

Notes on the dataset

Directories:

  • csvs: original and updated csvs with desired columns
  • inserts: insert scripts for tables, benchmarking code for complex queries
  • django-1.7.7: django code
  • django: the web application

Project Description:

  • Running Django on AWS EC2
  • Connecting to Oracle RDS DB and DynamoDB
  • Using Yelp Dataset Challenge Data

Instructions for getting setup:

  • For csvs and inserts, we decided they were too big to submit. So if you want to see them, we linked to them on Dropbox: https://www.dropbox.com/sh/0k5s8wlveecsyqm/AAAQGN6juTHxrVaykzmV5DxXa?dl=0

  • For dataset:

    1. Download yelp dataset
    2. Use the json to csv converter script to convert the json files
    3. Use the get_cols and split_cats scripts to get the specific columns you want
    4. Use the insert_data script to create the insert data script from the data you have
    5. Run the create tables sql file and the other generated files to insert the data
  • For django:

    1. Install django and cxOracle
    2. You can run 'sh runserver.sh' to start the django server

About

CIS 550 Project Using Yelp Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published