Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
-
Updated
Oct 31, 2024 - Java
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
A tool to find podcast metadata over an external api, store them, get their rss feeds and run ETL using Airflow, Kafka, Spark, and Cassandra. The particular Cassandra distribution used is Elassandra, which allows seamless integration with Elasticsearch. Displayed using a Gatsby app, served using Flask
HVAC Engine: Psychrometrics (Humid Air) analysis Java library. Humid air properties and thermodynamic processes, flows, heating, cooling, air mixing and more. Immutable, thread-safe, very accurate.
open source based development related contents
First academic big data project to implement analysis using MapReduce and Hive platform
aiflow-dashboard
Use K8s CustomResourceDefinition to define and create Airflow Dags
Data pipeline for multi-model databases using Microsoft Azure Cosmos DB SQL and Gremlin APIs and orchestration using Apache Airflow.
Text similarity based on Word2Vec on Twitter and Reddit updates, using Spark, Spring, MongoDB and Apache Airflow
Examples for ETL pipeline services infrastructure
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."