Skip to content

wafemi999/etl-and-data-pipelines-airflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

etl-and-data-pipelines-airflow

ETL and Data Pipelines using Airflow.

Project Overview:

Project aims to de-congest the national highways by analyzing the road traffic data from different toll plazas. Each highway is operated by a different toll operator with different IT setup that use different file formats. As a vehicle passes a toll plaza, the vehicle's data like vehicle_id,vehicle_type,toll_plaza_id and timestamp are streamed to Kafka.

Procedure

  1. Collect data available in different formats and, consolidate it into a single file.
  2. Create a data-pipeline that collects the streaming data and loads it into a database.
  3. confirm the submitted DAG runs successfully usng command: ' airflow dags list '

About

ETL and Data Pipelines with Airflow

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages