A Generalizable Anomaly Detection Method in Dynamic Graphs

This repository contains the official implementation of the paper: "A Generalizable Anomaly Detection Method in Dynamic Graphs", accepted at AAAI 2025.

Abstract

Anomaly detection aims to identify deviations from normal patterns within data. This task is particularly crucial in dynamic graphs, which are common in applications like social networks and cybersecurity, due to their evolving structures and complex relationships. Although recent deep learning based methods have shown promising results in anomaly detection on dynamic graphs, they often lack of generalizability. In this study, we propose GeneralDyG, a method that samples temporal ego-graphs and sequentially extracts structural and temporal features to address the three key challenges in achieving generalizability: Data Diversity, Dynamic Feature Capture, and Computational Cost. Extensive experimental results demonstrate that our proposed GeneralDyG significantly outperforms state-of-the-art methods on four real world datasets.

Requirements

Preprocessing

Here, we provide two preprocessed datasets: Bitcoin-Alpha and Bitcoin-OTC. Please download the preprocessed datasets download the dataset and extract them into the current directory.

You can choose to preprocess the data before training or use the two sample files we provided. Please run the following command to preprocess the data:

python generate_datasets.py

Instructions

In generate_datasets.py, you can adjust the parameters k and dataset_name to generate different versions of preprocessed data.
k: Controls specific preprocessing behaviors.
dataset_name: Specifies the dataset to preprocess.

Provided Preprocessed Data

We provide preprocessed versions of the Alpha and OTC datasets with k=1.
These preprocessed datasets can be found in the dataset/ directory.

Directory Structure

After dataset preprocessing, the auto-generated folder structure of datasets is as follows:

dataset/
├── btc_alpha_0.5_0.01.csv
├── btc_alpha_0.5_0.05.csv
├── btc_alpha_0.5_0.1.csv
├── btc_alpha.pkl
├── btc_otc_0.5_0.01.csv
├── btc_otc_0.5_0.05.csv
├── btc_otc_0.5_0.1.csv
├── btc_otc.pkl

Start Training

After completing the preprocessing step, start the training process by running:

python Denoise.py --data_set btc_alpha --neg 01 --max_len 24

# General Parameters
# --dir_data [Path to the dataset directory, default='./dataset']
# --name_pos [Positive class name, default='EU3']
# --ratio_neg [Negative sample ratio, e.g., '1', default='1']
# --data_set ['wikipedia', 'reddit', 'wadi', 'btc_otc', 'btc_alpha']
# --neg ['01', '05', '1'] (Negative data ratio selection)
# --max_len [Maximum sequence length, e.g., 24 for 'wikipedia']

# Data Parameters
# --batch_size [Batch size, e.g., 128, default=128]
# --n_epochs [Number of epochs, default=200]
# --num_data_workers [Number of data workers, e.g., 0, default=0]
# --gpus [Number of GPUs, default=1]

# Model Parameters
# --ckpt_file [Path to the checkpoint file, default='./']
# --input_dim [Input dimension, e.g., 128, default=128]
# --hidden_dim [Hidden layer dimension, e.g., 258, default=258]
# --n_heads [Number of attention heads, default=4]
# --drop_out [Dropout rate, e.g., 0.4, default=0.4]
# --n_layer [Number of network layers, default=6]
# --learning_rate [Learning rate, e.g., 0.0001, default=0.0001]
# --seed [Random seed, default=95540]

License

This project is released under the MIT License. Our models and codes must only be used for research purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
model		model
modules		modules
README.md		README.md
datasets.py		datasets.py
generate_datasets.py		generate_datasets.py
option.py		option.py
process.png		process.png
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Generalizable Anomaly Detection Method in Dynamic Graphs

Abstract

Requirements

Preprocessing

Instructions

Provided Preprocessed Data

Directory Structure

Start Training

License

About

Releases

Packages

Languages

YXNTU/GeneralDyG

Folders and files

Latest commit

History

Repository files navigation

A Generalizable Anomaly Detection Method in Dynamic Graphs

Abstract

Requirements

Preprocessing

Instructions

Provided Preprocessed Data

Directory Structure

Start Training

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages