Skip to content
#

data-transformation

Here are 454 public repositories matching this topic...

This repository contains the NYC Taxi Data Engineering Pipeline project, which aims to build a comprehensive data engineering pipeline using NYC taxi data from the years 2022 and 2023. The pipeline involves extracting, transforming and loading (ETL) data into a Snowflake database, followed by creating a dashboard for visualisation.

  • Updated Jul 4, 2024
  • Python

Culled from the UCI Machine Learning Repository, the Dry Bean Dataset (licensed under CC BY 4.0) provides valuable insights into bean classification and is a valuable resource for machine learning enthusiasts.

  • Updated Jul 2, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-transformation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-transformation topic, visit your repo's landing page and select "manage topics."

Learn more