Malware Classifier Backdoor Attacks and Defenses

Read the paper here!

This project demonstrates how to execute backdoor attacks on malware classifiers and evaluate their performance under different conditions. This project now supports both LightGBM torch models. The pipeline automatically skips model training if disabled in the config. The test suite further evaluates model performance using two types of confusion matrices (standard and simplified) and updated metrics.

Steps

Poison the Training Data: Inject backdoor samples into the dataset.
Train the Model: Train a malware classifier on the poisoned dataset.
Test on Clean Data: Evaluate the model’s performance on unpoisoned data.
Test on Backdoor Data: Assess the model’s vulnerability to backdoor samples.

Setup Instructions

Update and rebuild the container

./update_build.sh
# this will remove the existing container and build a new one
# and will run and enter the container

Run the unit tests

python -m unittest discover -s scripts/unit_tests
# or ./unit_tests.sh

Execute the pipeline detailed below.

Pipeline

Create config.yaml file
Run the pipeline:

python -m scripts.pipeline --config config.yaml --log data/pipeline.log &
#  or ./run_pipeline.sh

Grid Search

python scripts/grid_search.py --grid_search

Testing Details

The test suite now generates two types of confusion matrices:

A standard confusion matrix with three categories (benign, malicious, and backdoor malicious).
A simplified “square” confusion matrix focusing on benign vs. malicious only. It also calculates updated metrics (Accuracy, Precision, Recall, F1 Score, ROC AUC) for each variant, providing a more detailed view of how the model performs against backdoored samples.

The test suite evaluates the trained model across the following data types:

Clean Data:
- Unpoisoned benign samples
- Unpoisoned malicious samples
Poisoned Data:
- Poisoned benign samples
- Poisoned malicious samples

Metrics:

The test suite provides the following evaluation metrics:

Accuracy
Precision
Recall
F1 Score
ROC AUC

Visualizations:

The following plots are generated during testing:

Confusion Matrix
ROC Curve

Data Structure

The data is organized into the following directories:

data/
├── raw/ # Contains unprocessed executables
│ ├── benign/
│ └── malicious/
├── poisoned/ # Contains poisoned executables
│ ├── <backdoor_name>/
      |── benign/
      |── malicious/
│ └── <backdoor_name>/
└── ember/ # Contains the poisoned dataset in EMBER format
  ├── test.jsonl
  ├── train.jsonl

Reference

Please use the below .bib to cite the research report!

@article{burke2025resilient,
  author = {Hamish Burke},
  title = {Resilient by Design: Investigating Backdoor Vulnerabilities in Malware Detection Systems},
  year = {2025},
  month = {January},
  institution = {Victoria University of Wellington},
}

Name		Name	Last commit message	Last commit date
Latest commit History 524 Commits
bash_scripts		bash_scripts
data		data
ember		ember
licenses		licenses
models		models
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
TODO.md		TODO.md
attack_vs_clean.png		attack_vs_clean.png
backdoor_types.png		backdoor_types.png
burkehami_comp441_report.pdf		burkehami_comp441_report.pdf
config.yaml		config.yaml
createplots.sh		createplots.sh
ensemble_comparison.png		ensemble_comparison.png
model_comparison.png		model_comparison.png
poisoning_vs_attack.png		poisoning_vs_attack.png
poisoning_vs_roc_auc.png		poisoning_vs_roc_auc.png
pytest.ini		pytest.ini
requirements_conda.txt		requirements_conda.txt
tradeoff.png		tradeoff.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Malware Classifier Backdoor Attacks and Defenses

Steps

Setup Instructions

Pipeline

Grid Search

Testing Details

Metrics:

Visualizations:

Data Structure

Reference

About

Releases

Packages

Languages

License

Slaymish/malware-classifier-backdoors

Folders and files

Latest commit

History

Repository files navigation

Malware Classifier Backdoor Attacks and Defenses

Steps

Setup Instructions

Pipeline

Grid Search

Testing Details

Metrics:

Visualizations:

Data Structure

Reference

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages