Skip to content

mharty3/data_engineering_zoomcamp_2022

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Engineering Zoomcamp 2022

My Course Notes

  • Data Lakes
  • Data Pipeline Orchestration with Airflow
  • Basics of Data Warehousing and Big Query
  • Ingesting data into BQ Data Warehouse with Airflow
  • Optimizing performance and cost with partitioning and clustering in BQ
  • Machine Learning in BQ
  • ETL vs ELT
  • dbt basics
  • transformations in the data warehouse and dbt Cloud
  • dbt Project Repo
  • Dashboards in Google Data Studio
  • Batch vs Streaming
  • Installing Spark
  • Spark SQL and DataFrames
  • Spark Internals

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published