The goal of the project was to create a data engineering pipeline using the NYC TLC Rideshare data and create a final dashboard to perform the analytics.
Technologies Used:
- Python3 and Jupyter Notebook to understand and transform the data.
- Google Cloud Storage Bucket to store the data.
- Google Cloud Compute Engine integrated with Mage (A modern replacement of Apache Airflow) to build the data pipeline.
- Google Cloud BigQuery to store the database and create analytical layer.
- Google Looker Studio to build the final dashboard.
Final Dashboard : Google Looker Studio