date | duration | maintainer | order | title |
---|---|---|---|---|
w01d01 |
90 |
adashofdata |
3 |
Pandas Intro |
- (30m) EDA Presentation
- (60m) Pandas Basics Notebook
- (30m + 15m Optional) Pandas Exercise ( Solution )
By the end of this lecture, the students should know:
- The basic data science workflow and where data cleaning and EDA fit in
- Common data issues and how to resolve them (such as imputing missing data)
- Common EDA techniques and the tools used for EDA (including visualization tools)
- How to manipulate data using pandas, including
- Basic pandas data structures such as series and dataframes
- How to read in data using pandas and how to view basic info and descriptive stats about dataframes
- How to select rows and columns in a dataframe using .loc and .iloc
- How to create masks and filter data
- How to deal with NULL values, and the difference between NaN and None values
- How to groupby and sort in pandas