Apache Superset is a Data Visualization and Data Exploration Platform
-
Updated
Jul 4, 2024 - TypeScript
Apache Superset is a Data Visualization and Data Exploration Platform
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Workflow Engine for Kubernetes
The Data Engineering Cookbook
Roadmap to becoming a data engineer in 2021
An orchestration platform for the development, production, and observation of data assets.
Turns Data and AI algorithms into production-ready web applications in no time.
Always know what to expect from your data.
🐚 Python-powered shell. Full-featured and cross-platform.
Fancy stream processing made operationally mundane
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
Open Source Feature Flagging and A/B Testing Platform
The open source high performance ELT framework powered by Apache Arrow
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."