Skip to content
#

etl-pipeline

Here are 1,439 public repositories matching this topic...

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

  • Updated Jul 6, 2024
  • Jupyter Notebook

This project demonstrates the integration of data from multiple sources into a unified data lake. The project showcases the use of AWS Glue for ETL tasks, Amazon S3 as a data lake, AWS Data Pipeline for data movement automation, Amazon RDS for relational data storage, and Amazon Redshift for data warehousing and analysis.

  • Updated Jul 5, 2024

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more