lakeFS - Data version control for your data lake | Git for data
-
Updated
Oct 31, 2024 - Go
lakeFS - Data version control for your data lake | Git for data
RealTime StockStream is a streamlined, simulation system for processing live stock market data. It uses Apache Kafka for data input, Apache Spark for data handling, and Apache Cassandra for data storage, making it a powerful yet easy-to-use tool for financial data analysis
DDL for Kudu, Impala, Phoenix, HBase, Hive, MySQL, PostgreSQL, Calcite, ... Tables. SQL.
This repository contains Apache Spark programs implemented in Python. These programs are part of my learning process for Apache Spark and are intended to serve as examples for anyone who is also learning or working with Apache Spark.
Add a description, image, and links to the apache-sparksql topic page so that developers can more easily learn about it.
To associate your repository with the apache-sparksql topic, visit your repo's landing page and select "manage topics."