Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
-
Updated
Sep 1, 2022 - Python
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
Apache Hudi examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Apache Icebery examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Deltalake examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Cluster Creation using Terraform.
EMR Notebooks and SageMaker using Terraform.
PennBook is a highly scalable implementation of the core functionalities of facebook.com. It uses a Node.js server, React.js for the frontend, and Hadoop libraries such Apache Spark along with AWS Elastic MapReduce for the Big Data functionalities.
Hive Workshop using CloudFormation.
Hudi Workshop using Terraform.
Pig Workshop using CloudFormation.
Orchestrating Amazon EMR with AWS StepFunctions using CloudFormation.
EMR Managed Scaling using CloudFormation.
Presto Workshop using Terraform.
Orchestrating Amazon EMR with AWS StepFunctions using Terraform.
Spark-based ETL using Terraform.
EMR Notebooks and SageMaker using CloudFormation.
Hudi Workshop using CloudFormation.
Pig Workshop using Terraform.
EMR Managed Scaling using Terraform.
Presto Workshop using CloudFormation.
Add a description, image, and links to the elastic-map-reduce topic page so that developers can more easily learn about it.
To associate your repository with the elastic-map-reduce topic, visit your repo's landing page and select "manage topics."