dataproc
Here are 87 public repositories matching this topic...
Support code for the article "Connecting GCP Dataproc and Elasticsearch: Bridging the Worlds of Big Data and (vector) Search"
-
Updated
Nov 26, 2023 - Jupyter Notebook
Data Workflows with GCP Dataproc, Apache Airflow and Apache Spark
-
Updated
Mar 4, 2020 - Python
Repositório para armazenar artefatos de um trabalho da disciplina de Computação Distribuída.
-
Updated
Jun 26, 2023 - Python
Generando un proceso ETL con dataset de Amazon
-
Updated
Mar 7, 2022 - Jupyter Notebook
Big data analysis of 'shared-world' cloud application.
-
Updated
Jul 8, 2020 - Jupyter Notebook
-
Updated
Nov 18, 2020 - Shell
Inventory value is also important for determining a company's liquidity, or its ability to meet its short-term financial obligations. A high inventory value can indicate that a company has too much money tied up in inventory, which could make it difficult for the company to pay its bills.
-
Updated
Oct 15, 2023 - Jupyter Notebook
Google DataProc Spark Scala Job for MNIST Handwritten Digit Recognition using Decision Trees (Spark MLlib)
-
Updated
Jan 2, 2018 - Perl 6
Orchestration Dataproc serverless job with Airflow
-
Updated
Oct 25, 2023 - Python
DataTalksClub Data Engineering Zoomcamp Project
-
Updated
Mar 16, 2024 - Python
Creating a robust and scalable data pipeline on Google Cloud Platform (GCP) to monitor and analyze stock performance. Leveraging the power of GCP's data processing and storage services, a comprehensive solution has been built to efficiently collect, process, and visualize stock data.
-
Updated
Sep 7, 2023 - Python
-
Updated
Nov 30, 2018 - Python
Collected data about from three sources, one opinion-based social media in twitter, research data in New York Times, and the third is the common crawl data for the same topic or key phrase, and from similar time periods. Processed the three data sets collected individually using classical big data methods like Map Reduce in Google Dataproc Clust…
-
Updated
Oct 25, 2019 - Python
A Pyspark project that performs ETL on a Dataproc cluster and writes data to Google Cloud Storage/BigQuery.
-
Updated
Dec 23, 2023 - Jupyter Notebook
Improve this page
Add a description, image, and links to the dataproc topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataproc topic, visit your repo's landing page and select "manage topics."