Skip to content

abyssnlp/pyspark-delta

Repository files navigation

Data Processing with PySpark and Delta Lake on AWS EMR

This is the companion code for the Unskew data blog post here.

We use PySpark to process data and write it to S3 as a delta lake table. Later, we discuss how to deploy this PySpark application to EMR.

As always, please write to us with any questions, comments or improvements.

About

Data Processing with PySpark and Delta lake on AWS EMR

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published