Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
-
Updated
Oct 22, 2024 - Jupyter Notebook
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
The spatial table format for spatial lakehouse
Implemented spatial hotspot analysis on the NYC Yellow Cab taxi trip records using spark cluster setup on the AWS EC2 Instances. The aim was to analyse huge dataset using distributed cluster-computing framework like Apache Spark and Apache Sedona.
Dockerised PySpark Apache Sedona examples.
Exploring Global Fishing Watch public data with SedonaDB & GeoParquet
Spatial joining with a map reduce program on top of Apache Spark using the Apache Sedona spatial extension
CSE 512 : Distributed Database Systems
Notebook to accompany the "Hands-On With Havasu & GeoParquet" livestream
Add a description, image, and links to the apache-sedona topic page so that developers can more easily learn about it.
To associate your repository with the apache-sedona topic, visit your repo's landing page and select "manage topics."