GitHub - peterle93/Toronto-Airbnb-Analysis: Analyses the Toronto Airbnb dataset from Sept 2020. Performs exploratory analysis and builds a predictive machine learning model to predict the most important features that impact Airbnb listings

Project Motivation

Toronto Airbnb Dataset Analysis

This project (Write a Data Science Blog Post) is part of Udacity Data Scientist Nanodegree Program. I used Toronto Airbnb Dataset for this project as its the city I live in. I'm interested in using data science techniques to analyze ways to improve future listings. The questions analyzed may be similar to data sources one might encounter in a business setting. Additionally, many of the approaches and skills used in this project can be applicable to future work projects.

Using the data, I answered the following questions:

What are the most common amenities in the dataset?
Which neighborhoods have the highest number of listings and rating review scores?
What is the relationship between the type of room and price listing?
What are the most influential features of the dataset to predict the price of a listing?

The dataset describes the listing activities. The original dataset can be found here: https://www.kaggle.com/robinkongninglo/toronto-airbnb-dataset

Summary of Results

Determined the most common amenities in Toronto listings are:

Wifi
Heating
Smoke Alarm
Essentials
Kitchen

Waterfront Communities - The Island has the most listings, followed by Niagara, and then Annex.
Forest Hill South, Ionview, and High Park-Swansea have the highest review score ratings.
Entire home/apt has the highest median price compared to the other room type listing. Shared room is at the lowest median.
The features that has the most influence on the price listing are bedrooms, followed by Entire home/apt, then accommodates.

Medium Blog Post

Here is the Medium blog post I have written: https://le-peter1993.medium.com/data-exploration-for-toronto-airbnb-56b5387d7007

Libraries:

I use Python3 in my Jupyter Notebook:

Numpy
Pandas
Scikit Learn
Matplotlib
Seaborn
Folium
Collections
Math

File Descriptions

Toronto Airbnb Dataset.ipynb - Jupyter notebook with complete analysis, answers to the questions, explanations and visualisations
listings_sep_09_2020.csv - Original Toronto Airbnb Dataset from Sept 2020 in csv format

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
README.md		README.md
Toronto Airbnb Dataset.ipynb		Toronto Airbnb Dataset.ipynb
azure-pipelines.yml		azure-pipelines.yml
listings_sep_09_2020.csv		listings_sep_09_2020.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Project Motivation

Toronto Airbnb Dataset Analysis

Summary of Results

Medium Blog Post

Libraries:

File Descriptions

Acknowledgements

About

Releases

Packages

Languages

peterle93/Toronto-Airbnb-Analysis

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Project Motivation

Toronto Airbnb Dataset Analysis

Summary of Results

Medium Blog Post

Libraries:

File Descriptions

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages