ShapX Engine

A Demonstration of Shapley Value Approximations

📄 Contents

Overview
Demonstration
Evaluation
Contact
Acknowledgements

1. Overview

Interpreting decisions made by machine learning models helps build trust in their predictions, ultimately facilitating their practical application. Shapley values have emerged as a popular and theoretically robust method for interpreting models by quantifying the contribution of each feature toward individual predictions. The inherent complexity associated with the computation of Shapley values as an NP-hard problem has driven the development of numerous approximation techniques, leading to a plethora of options in literature. This abundance of choices has created a substantial gap in determining the most appropriate approach for practical applications. To address this gap, we propose ShapX, a web engine that comprehensively evaluates 17 approximation methods across diverse regression and classification tasks. ShapX facilitates an interactive exploration of the strengths and limitations of various Shapley value approximations by guiding users through the suitable selections of replacement and tractable estimation strategies. Ultimately, our study reveals that strategies competent at capturing all the feature interactions leading to accurate estimations of Shapley values. ShapX also allows users to effortlessly upload their own dataset along with the corresponding machine learning model, enabling them to obtain detailed individualized explanations. A detailed walkthrough video of the demonstration is available online: Video.

2. Get Started

ShapX Engine is a modular web interface designed to explore and evaluate various strategies for estimating Shapley values in machine learning. It offers a unified platform for benchmarking accuracy, assessing compute tradeoffs, and generating interactive model explanations for tabular data. Launch App

🔍 Overview

The engine is structured into five primary frames:

🧾 (a) Description

Introduces the motivation behind building a web-based explanation tool.
Provides a comprehensive user guide to help users navigate the interface.

📊 (b) Benchmark Details

Lists the Shapley value approximation techniques implemented.
Describes the evaluation metrics used to assess accuracy and efficiency.
Summarizes the datasets and models included in the benchmark suite.

📈 (c) Accuracy Evaluation

Visualizes performance variation across estimators using:
- Boxplots for accuracy distribution.
- Critical Difference Diagrams for rank-based comparison.
Supports evaluation by varying:
- Replacement strategies.
- Tractable estimation techniques.

⏱️ (d) Compute Time

Compares per-instance runtime of different estimators.
Includes:
- Bar plots for runtime comparison.
- Line plots showing the impact of input dimensionality.
- Bubble plots visualizing the accuracy-runtime tradeoff.

🧠 (e) Interactive Explanations

Upload a CSV dataset and pickled model (.pkl).
Choose a sample instance for explanation.
Generate Shapley values for that instance using the most effective method.
View a visual plot showing each feature’s contribution to the prediction.

🚀 Getting Started

To launch locally:

git clone https://github.com/SuchitGupte/shapley_demo.git
cd shapley_demo
pip install -r requirements.txt
streamlit run app.py

3. Evaluation

We break down the approximation of Shapley values into two principal dimensions. These dimensions also serve as a guide for setting up the evaluation framework. The first dimension involves properly treating missing values with the help of different replacement strategies. We deploy each replacement strategy against an exhaustive estimation of Shapley values. This evaluation measure will highlight the strengths and weaknesses of replacement strategies, aiding future research in selecting the most reliable strategy.

The second dimension focuses on tractable estimation strategies, which are crucial for efficiently computing Shapley values. We analyze the performance of these tractable estimation strategies using established approximation algorithms. We systematically evaluate 8 distinct replacement strategies and 17 distinct approximation algorithms across a diverse set of 100 datasets. This comprehensive evaluation enables us to thoroughly assess the performance and efficacy of different strategies and approximations in estimating Shapley values across varied data scenarios.

Approaches	Estimation Strategy	Replacement Strategy
Exhaustive Sampling	Exact (All potential feature subsets)	Conditional Distribution: Separate Models
Interactions-based Method for Explanation	Random Order	Marginal Distribution: Empirical
Conditional Expectations Shapley	Random Order	Conditional Distribution: Empirical
Shapley Cohort refinement	Random Order	Conditional Distribution: Empirical
Multilinear Sampling	Multilinear Extension	Marginal Distribution: Empirical
KernelSHAP	Weighted Least Squares	Marginal Distribution: Empirical
Parametric KernelSHAP	Weighted Least Squares	Conditional Distribution: Parametric Assumption(Gaussian/Copula)
Non-Parametric KernelSHAP	Weighted Least Squares	Conditional Distribution: Empirical
SGD-Shapley	Weighted Least Squares	Predetermined Baseline: Mean
FastSHAP	Weighted Least Squares	Conditional Distribution: Surrogate model
Independent LinearSHAP	Linear	Marginal Distribution: Empirical
Correlated LinearSHAP	Linear	Conditional Distribution: Parametric Assumption(Gaussian)
Tree Interventional	Tree	Marginal Distribution: Empirical
Tree Path-dependent	Tree	Conditional Distribution: Empirical
DeepLIFT	Deep	Predetermined Baseline: All-zeros
DeepSHAP	Deep	Marginal Distribution: Empirical
DASP	Deep	Predetermined Baseline: Mean

📫 4. Contact

If you have any questions, suggestions, or ideas for improvement, feel free to:

Open an issue on this repository
Start a discussion
Reach out via email: Suchit Gupte (gupte.31@osu.edu), John Paparrizos (paparrizos.1@osu.edu)

We welcome contributions and collaboration!

⭐ 5. Acknowledgements

This project is part of a broader initiative to standardize and democratize interpretability research in machine learning. We thank the research community for their foundational work in Shapley value approximations and their continued efforts in explainable AI. This benchmark builds on their insights and aims to further the goal of trustworthy and transparent machine learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
demoFiles		demoFiles
README.md		README.md
app.py		app.py
desc.jpg		desc.jpg
experror.jpg		experror.jpg
frames.jpg		frames.jpg
logo.png		logo.png
replacementStrategies.png		replacementStrategies.png
requirements.txt		requirements.txt
title.jpg		title.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ShapX Engine

A Demonstration of Shapley Value Approximations

📄 Contents

1. Overview

2. Get Started

🔍 Overview

🧾 (a) Description

📊 (b) Benchmark Details

📈 (c) Accuracy Evaluation

⏱️ (d) Compute Time

🧠 (e) Interactive Explanations

🚀 Getting Started

3. Evaluation

📫 4. Contact

⭐ 5. Acknowledgements

About

Releases

Packages

Languages

TheDatumOrg/ShapX-Engine

Folders and files

Latest commit

History

Repository files navigation

ShapX Engine

A Demonstration of Shapley Value Approximations

📄 Contents

1. Overview

2. Get Started

🔍 Overview

🧾 (a) Description

📊 (b) Benchmark Details

📈 (c) Accuracy Evaluation

⏱️ (d) Compute Time

🧠 (e) Interactive Explanations

🚀 Getting Started

3. Evaluation

📫 4. Contact

⭐ 5. Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages