Course material for Introduction to Data Analytics - BA780
This repository is comprised of notebooks, datasets, and other related materials for Introduction to Data Analytics (BA780), an introductory course to data science in Python.
This repository can be accessed via this short link:
Most of the content come from Python Data Science Handbook by Jake VanderPlas (under the MIT license. Read more at the Open Source Initiative). Modifications and updates have been made.
Some of the notebooks come from A Whirlwind Tour of Python (free 100-page pdf) by Jake VanderPlas (under CC0 license). Some modifications and updates have been made in some places to keep its content up to date. A Whirlwind Tour of Python is a fast-paced introduction to essential components of the Python language for data science and/or scientific programming. This material was written and tested using Python 3.7, and should work for any Python 3.X version.
References:
- Python Data Science Handbook by Jake VanderPlas
- A Whirlwind Tour of Python by Jake VanderPlas
- Spark: The Definitive Guide, 2018 by Matei Zaharia, Bill Chambers.
- An Introduction to Statistical Learning by Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani.