etl-pipelines

A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.

bootstrap flask machine-learning plotly python3 data-analytics hyperparameter-optimization feature-engineering ensemble-models ml-pipelines etl-pipelines

Updated Jun 10, 2021
Python

omar-elmaria / airflow_local

Star

This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes

python airflow automation orchestration dags etl-pipelines

Updated Oct 30, 2022
Python

speedbits / LimitlessETL

Star

A Python and Spark based ETL framework. While it operates within speed limits that is framework and standards, but offers boundless possibilities.

etl etl-framework etl-pipeline etl-job etl-pipelines

Updated Apr 1, 2024
Python

Guilherme-B / baboon

Star

JSON-driven ETL pipeline framework prototype

json dag bonobo etl-pipelines

Updated Mar 25, 2020
Python

pranaypkadu / networksecurity

Star

End To End MLOPS Project With ETL Pipelines- Building Network Security System

python docker tensorflow aws-s3 numpy scikit-learn vscode pandas pytorch aws-ecr aws-ec2 network-security mongodb-atlas mlops mlflow github-actions fastapi etl-pipelines dagshub

Updated Jan 13, 2025
Python

SayamAlt / Formula-1-Data-Ingestion-Transformation---ETL-Pipeline

Star

This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark SQL, data governance with Unity Catalog, and visualization through Power BI. Designed to showcase real-world data engineering workflows in Azure.

data-transformation data-engineering spark-streaming data-ingestion spark-sql spark-mllib microsoft-azure databricks-notebooks azure-databricks delta-lake workflow-orchestration etl-pipelines azure-data-lake-storage-gen2

Updated Nov 14, 2024
Python

Improve this page

Add a description, image, and links to the etl-pipelines topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipelines topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipelines

Here are 14 public repositories matching this topic...

patterns-app / patterns-devkit

level-vc / useful

Chek0rrdn / DataEngineer_ETL

abrahamkoloboe27 / Airflow-Pipeline-Dashboard-Compagnie-Aerienne

ChristianRCanlas / ChristianRCanlas.github.io

angelxd84130 / Airflow-ETL

EmmanuelEzenwere / DataSift

juniors90 / PymaciesArg

siddarthaThentu / Disaster-Response-Pipeline

omar-elmaria / airflow_local

speedbits / LimitlessETL

Guilherme-B / baboon

pranaypkadu / networksecurity

SayamAlt / Formula-1-Data-Ingestion-Transformation---ETL-Pipeline

Improve this page

Add this topic to your repo