dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
-
Updated
Jul 21, 2025 - Python
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
MetricFlow allows you to define, build, and maintain metrics in code.
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/
Linked Open Data Modeling Language
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, event-enhanced Puppini bridges, and temporal resolution across DAS/DAB/DAR layers.
Type annotations for specifying, validating, and serializing arrays with arbitrary backends in Pydantic (and beyond)
Define, govern, and model event data for warehouse-first product analytics.
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Link Modeling Language (LinkML) model
Automated assistance for the schema development lifecycle
An end-to-end data pipeline which extracts divvy bikeshare data from web loads it into data lake and datawarehouse transforms it using dbt and finally , a dashboard to visualize the data using looker studio, the pipeline is orchestrated using prefect
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools
The dbt adapter for Firebolt
WG3 Metadata Specification
Development of the Gellish Communicator reference application and tools for universal data exchange and data integration supporting Formal English and other Gellish formalized natural languages.
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
Add a description, image, and links to the data-modeling topic page so that developers can more easily learn about it.
To associate your repository with the data-modeling topic, visit your repo's landing page and select "manage topics."