Skip to content

megaurko/spark-batch-examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

How to run project locally

  1. unzip the database dump file at docker/lahman2016.zip

  2. build docker MySQL container with test data

    cd docker
    docker build -t mysql-test .
  3. run detached database container

    docker run -d --name mysql-test -e MYSQL_ROOT_PASSWORD=password -p 3306:3306 mysql-test
  4. compile project

    from root directory:

    sbt assembly

    jar file will be located at target/scala-2.13/big-data-engineer-hw-assembly-0.1.0-SNAPSHOT.jar

  5. run spark submit

    copy jar to desired location and run spark-submit:

    spark-submit big-data-engineer-hw-assembly-0.1.0-SNAPSHOT.jar

    after running output folder is created and the results will appear there

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published