A VS Code Extension to make it easier to manage and develop Spark jobs on EMR
-
Updated
Jun 22, 2024 - TypeScript
A VS Code Extension to make it easier to manage and develop Spark jobs on EMR
A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs
⛳️ PASS: Amazon Web Services Certified (AWS Certified) Data Analytics Specialty (DAS-C01) by learning based on our Questions & Answers (Q&A) Practice Tests Exams.
⛳️ PASS: Amazon Web Services Certified (AWS Certified) Machine Learning Specialty (MLS-C01) by learning based on our Questions & Answers (Q&A) Practice Tests Exams.
Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR
Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR
Bits of code I use during live demos
Sample CI/CD pipeline for using GitHub Actions with Amazon EMR Serverless Spark.
Building Data Lake and ETL pipelines using Amazon EMR, S3, and Apache Spark
Samples related to data engineering, e.g. spark, embulk, airflow, etc.
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
Configure Hadoop YARN CapacityScheduler on Amazon EMR on Amazon EC2 for multi-tenant heterogeneous workloads
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Apache Airflow (MWAA) on AWS.
This repo provides cross-account integration code samples using Amazon S3 Access points
With Amazon EMR and machine learning techniques supported by PySpark, a model was built to assist the fictitious music streaming service provider to predict customer churn rate based on user click data.
Udacity Data Engineering Capstone project
Orchestrate an Amazon EMR on Amazon EKS Spark job with AWS Step Functions
Project files for the post: Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster.
Add a description, image, and links to the amazon-emr topic page so that developers can more easily learn about it.
To associate your repository with the amazon-emr topic, visit your repo's landing page and select "manage topics."