Skip to content
#

delta-tables

Here are 9 public repositories matching this topic...

A comprehensive ETL pipeline and sales analysis project leveraging Microsoft Azure and PySpark, designed to optimize e-commerce sales by providing actionable insights through detailed data analysis.

  • Updated Mar 22, 2024
  • Jupyter Notebook

This project builds a cloud-based pipeline to extract NYC taxi data from an API and store it in Azure Data Lake Storage (ADLS). Databricks and PySpark are used to transform the data through the medallion architecture (Bronze → Silver → Gold). Delta Lake ensures reliable storage, and Power BI provides visual insights for data-driven decision-making.

  • Updated Dec 3, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the delta-tables topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the delta-tables topic, visit your repo's landing page and select "manage topics."

Learn more