- Installation
- Project Motivation
- File Descriptions
- Visualizations
- Licensing, Authors, and Acknowledgements
All the librarires required to run the code are mentioned in requirements.txt.
To install Run: pip install -r requirements.txt
To train a mahcine learning model for predicting whether a person survived or not using the information provided in the dataset. For this project, I was also interested in using the dataset to better understand the following aspects.
- Were Females more like to survive?
- Were people of higher socioeconomic class more likely to survive?
- Were people with more siblings or spouses aboard less likely to survive?
-
Titanic.ipynb
: The jupyter notebook Titanic.ipynb includes data exploration, code, machine learning model and visualizations. After comparing the accuracy of severla lgorithms I decided to use I decided to use the Gradient Boosting model with accuracy score 84.77. -
Titanic_Profile_Report.html
: Profiling report of training data set. -
Data: Includes dataset provided by Kaggle for the competition
train.csv
: Training datasettest.csv
: Test dataet
-
Visualizations: Includes all plots generated from the training data for data exploration and analysis
AgeGroup-Survived.jpg
: Plot of AgeGroup vs. Survived featureCabinBool-Survived.jpg
: Plot of CabinBool vs. Survived featureParch-Survived.jpg
: Plot of Parch vs. Survived featurePclass-Survived.jpg
: Plot of Pclass vs. Survived featureSex-Survived.jpg
: Plot of Sex vs. Survived featureSibSp-Survived.jpg
: Plot of SibSp vs. Survived feature
Screenshot 1: Females were more likely to survive
Screenshot 2: People with more siblings or spouse aboard were less likely to survive
Screenshot 3: People with higher socioeconomic class were more likely to survive
Author: Rahul Gupta Copyright 2020
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.