Skip to content

Latest commit

 

History

History
 
 

serverless-glue

Complete AWS Glue example app

This example app is a full AWS Glue setup that deploys a Glue job using Python Shell Script and a Glue job using PySpark.

Overview

This example was created as a composition of the following examples:

Local Development

We have two options to run the services locally:

  • Using VSCode + Remote Containers (recommended)
  • Using Docker Compose - (longer, but more flexible)

Using VSCode + Remote Containers (recommended)

  1. Install Docker
  2. Install VSCode
  3. Install the Remote Development extension
  4. Clone this repository
  5. Create your application within a container (see gif below)

Create application within a container

after the container is running inside VSCode, you can try to run the jobs locally.

In the following gif we execute the following commands:

glue-spark-submit src/jobs/pyspark.py --JOB_NAME job_example --CUSTOM_ARGUMENT custom_value

Using Docker Compose manually

Check the full development documentation to learn how to setup a local development environment for DataNaN using Docker Compose.

Deployment

We use Serverless Framework to deploy the AWS Glue jobs among other resources. Check the full deployment documentation to learn how to deploy the AWS Glue jobs.