Welcome to PolkaStakeSphere, a pioneering project designed to uncover valuable stories and insights from the Polkadot ecosystem. Our goal is to empower blockchain enthusiasts, developers, and analysts with the tools they need to explore and understand the rich data landscape of Polkadot.
This repository is a work in progress, and we would like to express our gratitude to DTC Zoomcamp for providing the scaffold to kick-start this project.
The core principle of PolkaStakeSphere is the "Portable Data Stack." We firmly believe that data analytics should be accessible to everyone, regardless of their resources or technical expertise. With this in mind, we have designed this repository to enable seamless data analytics on your local machine, eliminating the need for expensive cloud infrastructure.
PolkaStakeSphere allows you to set up your own local data warehouse and dive into the fascinating world of Web3 data without the burden of high costs or complex dependencies. Our carefully crafted tools and frameworks enable you to extract, transform, and analyze blockchain data right on your personal computer.
-
Polkadot Ecosystem Insights: Leverage the raw big dataset collection from substrate-etl for both Polkadot and Kusama networks (relay and parachain) to extract and analyze data. Gain valuable insights into staking, validator status, and more.
-
Local Data Warehouse: PolkaStakeSphere empowers you to set up your own data analytics environment without relying on cloud services. This ensures privacy and gives you complete control over your data.
-
Zero Cloud Dependency: Once the raw dataset is extracted to your local data warehouse, you can operate independently of cloud platforms. This eliminates ongoing costs and potential data privacy issues.
Before you begin, ensure you have the following:
- Follow the instructions from substrate-etl, quick-start-analytics-hub to add raw datasets to your Google Cloud project.
- Populate env.example with the appropriate configuration values.
-
Clone the repository:
git clone https://github.com/shamb0/pds-PolkaStakeSphere.git
-
Navigate to the cloned directory:
cd pds-PolkaStakeSphere
-
Install the required Python libraries:
- Follow the instructions directed by make commands:
make prepare_environment source .activate_env && activate_env make install_dev_dependencies
- Follow the instructions directed by make commands:
-
Start a local deployment server of Dagster and materialize data assets in the pipeline:
make run_dagster_dev
-
Visualize the streamlit insight dashboard:
make view_leaderboard make view_validator_status
Distributed under the MIT License. See LICENSE
for more information.
shamb0 - @0shamb0 - r.raajey@gmail.com
Project Link: https://github.com/shamb0/pds-PolkaStakeSphere