Name		Name	Last commit message	Last commit date
parent directory ..
Oracle19CDCToDatabricksDeltaLaked30dd1ff-7029-41b6-943d-86771ee2e8b6.json		Oracle19CDCToDatabricksDeltaLaked30dd1ff-7029-41b6-943d-86771ee2e8b6.json
Oracle19ToDatabricksDeltaLake87a09a35-d201-497f-82ea-c7c03de13517.json		Oracle19ToDatabricksDeltaLake87a09a35-d201-497f-82ea-c7c03de13517.json
README.md		README.md

README.md

Oracle 19c Bulk Ingest And Change Data Capture To Databricks Delta Lake

These pipelines demonstrate how to bulk ingest data from Oracle 19c and process Change Data Capture (CDC) into Databricks Delta Lake.

Prerequisites

StreamSets Data Collector 3.16.0 or higher. You can deploy Data Collector on your choice of cloud provider of choice, or you can download it for local use.
Access to Databricks cluster with Databricks Runtime 6.3 or higher
Ensure the prerequisites for Databricks Delta Lake are satisfied
Access to Oracle 19c database

Setup

Download the pipelines and import them into your Data Collector or Control Hub
After importing the pipelines into your environment and before running the pipelines, update pipeline parameters with your Oracle 19c JDBC URL, Databricks cluster JDBC URL, staging information on Databricks Delta Lake destination >> Staging tab and Table/Key columns information on Databricks Delta Lake destination >> Data tab.
Start your Databricks cluster

Technical Details

For techincal info and detailed explanation, refer to this blog.