Skip to content

Architecture of the solution and Process flow

Rajdeep Biswas edited this page Jun 11, 2020 · 2 revisions

Architecture of the solution

Architecture

Process flow

  1. Azure Databricks and Azure Blob Storage account are provisioned in Azure
  2. The source SAS token is stored in Azure Key Vault
  3. Data is read using SparkR notebooks from Azure Open Datasets in Azure Databricks
  4. Data is wrangled and enriched for further analysis using SparkR and stored in Azure Blob Storage in parquet format
  5. SparkR and 3rd party libraries are used for data analysis and visualization
  6. SparkR and 3rd party libraries are used for Forecasting and Anomaly detection.