Skip to content

Labanoras/awesome-dbt

 
 

Repository files navigation

Awesome dbt Awesome GitHub contributors GitHub commit activity

Welcome to the awesome curated list of dbt resources!

Any kind of contribution is greatly encouraged and appreciated. For making a contribution, please check the contribution guidelines first! Add new entries on the top of sections (LIFO) to keep fresh items more visible! Also, feel free to add new sections.

Happy contributing!

Contents

Get Started

Courses from where you can get started with Analytics Engineering.

How To

Helping hand on setting up integrations and implementing best practices.

Integrations

Collection of known data integrations with dbt

  • Datafold - Gives a quick print out summary of changes so you can move fast and (not) break stuff!
  • Raycast dbt Metadata - Queries the dbt Cloud API to return some useful information about your models (number of tests, time they took to run etc…).
  • Cube - APIs, Caching, and Access Control on top of dbt Metrics.
  • FlexIt Analytics - Business Intelligence platform with deep dbt Cloud and CLI integration.
  • Raycast dbt Jobs - Raycast integration to monitor dbt Cloud Jobs.
  • Metaplane - Data Observaibility layer on top of your dbt + BI project.
  • Dbt + Machine Learning: What makes a great baton pass? - Landscape of ML utilities around dbt.
  • Soda - Integration of Soda's data observability platform and dbt.
  • Supported Adapters - Offically supported database adapters.
  • Lightdash - Open source Looker alternative with deep dbt integration.
  • Superset - Open source visualization layer for your Modern Data Stack.
  • Dagster and dbt: Better Together - Getting started with the dagster-dbt library.
  • fal - Add multi-language support (Python) to your dbt project.
  • Privacy Dynamics - Anonymize data in your dbt project.
  • prefect-dbt - Collection of Prefect integrations for working with dbt with your Prefect flows.

User Stories

Use-cases and user stories implemented by the community members using components of the MDS with dbt.

Data Quality

Best-practices and extensions of the testing framework.

CI/CD

Make the best out of your product quality and seamless delivery.

Orchestration

Resources to manage and maintain dependencies in modern data pipelines.

Utilities

Useful tools and extensions to bump up your analytics engineer worklow.

Packages

Community-developed packages to extend default macros and toolset.

  • dbt_linreg - Linear regression in SQL using dbt.
  • dbt-snowflake-query-tags - Automatically tag dbt-issued queries with informative metadata.
  • snowflake-resource-monitoring - Yet another package to monitor Snowflake usage.
  • usagedata - Provides insights on the database/table level usage informations from Snowflake.
  • dbt_ml - Package for dbt that allows users to train, audit and use BigQuery ML models.
  • ddbt - This repo represents my attempt to build a fast version of DBT which gets very slow on large projects (3000+ data models). This project attempts to be a direct drop in replacement for DBT at the command line.
  • dbt-snowflake-monitoring - A dbt package to help you monitor Snowflake performance and costs.
  • datavault4dbt - Macros for staging and creation of all DataVault-Entities you need, to build your own DataVault2.0 solution.
  • DDO - Perform DataOps & administrative CI/CD on your data warehouse.
  • dbt-yaml-check - Checks that columns defined in YAML also exist in SQL.
  • data-diff - A command-line tool and Python library to efficiently diff rows across two different databases.
  • dbt-project-evaluator - This package highlights areas of a dbt project that are misaligned with dbt Labs' best practices.
  • dbt_constraints - Generate database constraints based on the tests in a dbt project.
  • dbt-date - Date logic and calendar functionality.
  • dbt-privacy - Macros to make it easier to protect your customers' data.
  • dbt-fivetran-utils - General macros and helpers.
  • dbt_metrics - Macros to support secondary calculations and generate business metrics.
  • dbt-metabase - Model synchronization from dbt to Metabase.
  • dbt-coves - CLI tool for generating a scaffold for your dbt project.
  • dbt-profiler - Data profiling and doc block generator.
  • dbt_utils - General macros library. A must have.
  • dbt_audit_helper - Macros for data audits that compare columns values and schemas between tables.
  • dbt-ml-preprocessing - A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
  • dbt-external-tables - Macros to stage your external sources.
  • dbt-feature-store - Macros to build a feature store right within your dbt project.
  • dbt-codegen - Macros that generate dbt code, and log it to the command line.
  • dbt-init - Create a project and populate as much of the dbt project as possible.
  • dbt-artifacts - This package builds a mart of tables from dbt artifacts loaded into a table.
  • dbt-erdiagram-generator - This packages generate ERD diagrams from a dbt project.
  • Terraform-dbt Cloud Module - IAC in dbt Cloud via Terraform.
  • dbt2looker - Generate Looker views for dbt models.
  • dbt-coverage - Checks dbt docs and tests coverage.
  • dbt-meta-testing - Yet another coverage testing.
  • dbt-superset-lineage - Push and pull metadata between dbt to Superset.
  • dbtvault - Package for generating and executing ETL for Data Vault 2.0.
  • dbt-invoke - CLI for creating, updating, and deleting dbt property files.
  • dbt-unit-testing - Package which contains macros to support unit testing.

Community

Conferences, meetups, dicussions, newsletters, podcasts, etc. led by fellow analytics engineers and forums of contact.

Sample Projects

Sample projects which work out-of-the box. Reflect use-cases publicly available.

Contributors

Thanks for all the great resources! Can't see your avatar? Check the contribution guide on how you can submit your resources to the community!

About

A curated list of awesome dbt resources

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published