This repository contains the notebooks for the Data Engineering course.
Lab # |
Topic | Lab Notebook |
Exercise Solutions Notebook |
---|---|---|---|
1 | Introduction to Anaconda, Python & Pandas | Lab 1 |
|
2 | Data Visualization | Lab 2 |
Lab 2 Solutions |
3 | Data Tidying | Lab 3 |
Lab 3 Solutions |
4 | Data Cleaning | Lab 4 |
Lab 4 Solutions |
5 | Outliers | Lab 5 |
Lab 5 Solutions |
6 | Data Transformation | Lab 6 |
Lab 6 Task & Solutions |
7 | Data Integration and Feature Engineering | Lab 7 |
Lab 7 Solutions |
8 | Airflow | ||
9 | Introduction to PySpark | Lab 9 |
Lab 9 Solutions |
|