This repository contains example code and sample data for Structured data analysis with Spark SQL session. Follow the below steps to clone code and setup your machine.
- Java
- Maven 3
git clone https://github.com/phatak-dev/structured_data_processing_spark_sql
mvn clean install
then run the following command from code directory
java -cp target/spark-sql-examples.jar com.madhukaraphatak.spark.sql.sqlquery.QueryCsv local src/main/resources/sales.csv
You can run all the examples from terminal. If you want to run from the IDE, follow the below steps
- IDEA 14
Install scala plugin. Once plugin is loaded you can load it as maven project.
Please pull before coming to the session to get the latest code.