This repo is a guide to walk a GCP project Admin step-by-step through the process of setting up and validating the GCP resources required to conduct the hands-on Serverless Spark labs using Terraform.
- Google Cloud APIs
- VPC Network, Subnet, Firewall Rule and VPC Network Peering
- User Managed Service Accounts
- Persistent History Server (Dataproc on GCE Single Node cluster)
- BigQuery Dataset
- Dataproc Metastore Service
- Composer Environment
- Google Cloud Storage Buckets
- Google Artifact Repository and custom container image
Following permissions / roles are required to execute the prerequisites
- Security Admin
- Project IAM Admin
- Service Usage Admin
- Service Account Admin
- Role Administrator
Please execute the following modules in sequence:
Shut down/delete resources when done to avoid unnecessary billing.