Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning

Introduction

Low-Rank Adaptation (LoRA) has become ubiquitous for efficiently fine-tuning foundation models. However, federated fine-tuning of LoRA remains challenging due to non-optimal updates arising from traditional federated averaging of individual adapters. Existing solutions either incur prohibitively high communication cost that scales linearly with the number of clients or suffer from performance degradation due to limited expressivity. We introduce Federated Silver Bullet (Fed-SB), a novel approach for federated fine-tuning of LLMs using LoRA-SB, a recently proposed low-rank adaptation method. LoRA-SB learns a rxr matrix between B and A, while keeping other components fixed. Direct averaging of R guarantees exact updates, substantially reducing communication cost, which remains independent of the number of clients, and enables scalability. Fed-SB achieves state-of-the-art performance across commonsense reasoning, arithmetic reasoning, and language inference tasks while reducing communication costs by up to 230x. In private settings, Fed-SB further improves performance by (1) reducing trainable parameters, thereby lowering the noise required for differential privacy, and (2) avoiding noise amplification introduced by other methods. Overall, Fed-SB establishes a new Pareto frontier in performance vs. communication cost, offering an efficient and scalable solution for both private and non-private federated fine-tuning.

Environment Setup

First, create and activate a Conda environment with Python 3.10, then install the necessary libraries by running:

conda create -n fed-sb python=3.10
conda activate fed-sb
pip install -r requirements.txt
pip install -e .
cd fed_sb

Arithmetic Reasoning

To fine-tune a model on the MetaMathQA dataset and evaluate its performance on GSM8K and MATH benchmarks, execute the following script:

bash fed/scripts/arithmetic.sh

You can modify the BASE_MODEL parameter within the script to experiment with different models.

Commonsense Reasoning

Dataset Preparation

Fine-tuning Data:
Download the fine-tuning dataset from this link and place it in the data/commonsense directory.
Evaluation Data:
Download the evaluation datasets from this repository. Save each dataset in its corresponding subdirectory under data/commonsense.

Running the Experiments

Once the datasets are in place, run the experiments with:

bash fed/scripts/cr.sh

This script fine-tunes a model on the Commonsense170K dataset and evaluates it across eight different datasets. The BASE_MODEL parameter is configurable, allowing you to test various models.

Privacy-Preserving Fine-Tuning

Dataset Preparation

Download the SNLI dataset from this link, unzip it, and store the contents in the DP/SNLI/data directory.

Running the Experiments

There are two scenarios for privacy-preserving fine-tuning:

Centralized Private Fine-Tuning:
Run the following script:
```
bash DP/SNLI/scripts/central_private.sh
```
Federated Private Fine-Tuning:
Run the following script:
```
bash DP/SNLI/scripts/fed_private.sh
```

In both cases, you can adjust the epsilon parameter as needed to tune the level of privacy.

Citation

If you use our work, please cite us:

@misc{singhal2025fedsbsilverbulletextreme,
      title={Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning}, 
      author={Raghav Singhal and Kaustubh Ponkshe and Rohit Vartak and Lav R. Varshney and Praneeth Vepakomma},
      year={2025},
      eprint={2502.15436},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.15436}, 
}

@article{ponkshe2024initialization,
  title={Initialization using update approximation is a silver bullet for extremely efficient low-rank fine-tuning},
  author={Ponkshe, Kaustubh and Singhal, Raghav and Gorbunov, Eduard and Tumanov, Alexey and Horvath, Samuel and Vepakomma, Praneeth},
  journal={arXiv preprint arXiv:2411.19557},
  year={2024}
}

@misc{singhal2025fedexloraexactaggregationfederated,
      title={FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models}, 
      author={Raghav Singhal and Kaustubh Ponkshe and Praneeth Vepakomma},
      year={2025},
      eprint={2410.09432},
      archivePrefix={arXiv},
      primaryClass={cs.DC},
      url={https://arxiv.org/abs/2410.09432}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
fed_sb		fed_sb
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning

Introduction

Environment Setup

Arithmetic Reasoning

Commonsense Reasoning

Dataset Preparation

Running the Experiments

Privacy-Preserving Fine-Tuning

Dataset Preparation

Running the Experiments

Citation

About

Releases

Packages

Languages

CERT-Lab/fed-sb

Folders and files

Latest commit

History

Repository files navigation

Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning

Introduction

Environment Setup

Arithmetic Reasoning

Commonsense Reasoning

Dataset Preparation

Running the Experiments

Privacy-Preserving Fine-Tuning

Dataset Preparation

Running the Experiments

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages