Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos

Abstract

Federated learning, aimed at leveraging vast datasets distributed across numerous locations and devices, confronts a crucial challenge: the heterogeneity of data across different silos. While previous studies have explored discrete representations to enhance model generalization across minor distributional shifts, these approaches often struggle to adapt to new data silos with significantly divergent distributions. In response, we have identified that models derived from federated training exhibit markedly increased uncertainty when applied to data silos with unfamiliar distributions. Consequently, we propose an innovative yet straightforward iterative framework, termed Uncertainty-Based Extensible-Codebook Federated Learning (UEFL). This framework dynamically maps latent features to trainable discrete vectors, assesses the uncertainty, and specifically extends the discretization dictionary or codebook for silos exhibiting high uncertainty. Our approach aims to simultaneously enhance accuracy and reduce uncertainty by explicitly addressing the diversity of data distributions, all while maintaining minimal computational overhead in environments characterized by heterogeneous data silos. Through experiments conducted on five well-known datasets, our method has demonstrated its superiority, achieving significant improvements in accuracy (by 3%--22.1%) and uncertainty reduction (by 38.83%--96.24%), thereby outperforming contemporary state-of-the-art methods.

Updates

02/18/2024

Source codes are released.

Data

Prepare the data as the following structure:

datasets/
├──MNIST/
├──fmnist/
│  ├── train-images-idx3-ubyte
│  ├── ......
├──.....

Simple Start

MNIST with 9 silos

python train.py --data mnist --num_silo 9 --num_dist 3 --sample 2000 --encoder cnn --depth 3 --num_codes 64 --seg 1 --round 20 --epoch 20 --step 20 --thd 0.1 --workdir /your/save/folder

MNIST with 5 silos (unbalanced)

python train.py --data mnist --num_silo 5 --num_dist 3 --sample 2000 --encoder cnn --depth 3 --num_codes 64 --seg 1 --round 20 --round_plus 5 --epoch 20 --step 20 --thd 0.3 --workdir /your/save/folder

FMNIST with 9 silos

python train.py --data fmnist --num_silo 9 --num_dist 3 --sample 2000 --encoder cnn --depth 3 --num_codes 128 --seg 1 --round 30 --epoch 20 --step 20 --thd 0.1 --workdir /your/save/folder

CIFAR10 with 9 silos

python train.py --data cifar10 --num_silo 9 --num_dist 3 --sample 2000 --encoder vgg --num_codes 64 --seg 1 --round 30 --round_plus 5 --epoch 20 --step 20 --thd 0.1 --workdir /your/save/folder

To obtain the best performance, please use pretrained model on large image dataset

Citation

If you use UEFL in your research or wish to refer to the results published here, please use the following BibTeX entry. Sincerely appreciate it!

@article{zhang2024uncertainty,
  title={Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos},
  author={Zhang, Tianyi and Cao, Yu and Liu, Dianbo},
  journal={arXiv preprint arXiv:2402.18888},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dataset		dataset
model		model
results		results
README.md		README.md
flowchart.png		flowchart.png
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos

Abstract

Updates

Data

Simple Start

MNIST with 9 silos

MNIST with 5 silos (unbalanced)

FMNIST with 9 silos

CIFAR10 with 9 silos

Citation

About

Releases

Packages

Languages

destiny301/uefl

Folders and files

Latest commit

History

Repository files navigation

Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos

Abstract

Updates

Data

Simple Start

MNIST with 9 silos

MNIST with 5 silos (unbalanced)

FMNIST with 9 silos

CIFAR10 with 9 silos

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages