A SQL-only ETL data infrastructure
-
Updated
Feb 24, 2025 - PLpgSQL
A SQL-only ETL data infrastructure
Personal notes and lab solutions for the Data Engineer Handbook Bootcamp
This project implements a modern data engineering pipeline using Databricks, PySpark, DBT, and Delta Live Tables. It follows the Medallion Architecture, supports realtime data ingestion with Autoloader, and models data with fact and dimension tables, including Slowly Changing Dimensions (SCD Type 2), all orchestrated in a scalable cloud environment
Materials from The Data Engineering Academy
Add a description, image, and links to the dimensional-data-modeling topic page so that developers can more easily learn about it.
To associate your repository with the dimensional-data-modeling topic, visit your repo's landing page and select "manage topics."