Exploring Model Kinship for Merging LLMs

The degree of similarity or relatedness between LLMs, analogous to biological evolution

📄arXiv • 📒 Blog• 🤗 HF • 🎧NotebookLM Audio

We introduce Model Kinship, the metric for degree of similarity or relatedness between LLMs for continual model merging, analogous to biological evolution.

Currently, we support Model Kinship with 3 Similarity Metrics, others will be supported in the future.

Overview

Model merging provides a novel paradigm to leverage information from multiple models without the need of additional training. Recently, the development of a model merging toolkit has enabled non-experts to conduct merging experiments, spurring a trend of model merging on the Hugging Face Open LLM Leaderboard.

Currently, the model merging community has built powerful models through iterative merging steps. This process resembles artificial selection—a concept from biology in which humans deliberately select for or against specific traits in organisms.

However, the reasons behind the success of this process remain unknown, resulting in numerous trial-and-error attempts for slight performance improvements. Drawing inspiration from evolutionary biology, our project examines the weight changes that occur during post pre-training stages (e.g., fine-tuning, merging). We propose Model Kinship, a metric that evaluates the relatedness between two models by calculating the similarity of their weight changes, analogous to genetic variance in inheritance. In our paper we show that Model Kinship can be used for optimising the merging strategy.

This toolkit provides a simple way to calculate Model Kinship for model merging.

Installation

git clone https://github.com/zjunlp/ModelKinship.git
pip install -e ./ModelKinship

Usage

# Input Format
merge_cal model-1 model-2 model-base metrics

# Calculate Model Kinship based on Euclidean Distance (CPU)
merge_cal OpenPipe/mistral-ft-optimized-1218 \
mlabonne/NeuralHermes-2.5-Mistral-7B \
mistralai/Mistral-7B-v0.1 \
ed

# Multiple Calculation (CPU)
merge_cal OpenPipe/mistral-ft-optimized-1218 \
mlabonne/NeuralHermes-2.5-Mistral-7B \
mistralai/Mistral-7B-v0.1 \
cs,pcc,ed

Reproduction

To reproduce our experiments, both an evaluation toolkit and a merging toolkit for large language models are required. We recommend using the following tools:

Merged Models in Our Experiments are Open Access:

Merged Models Repository

Supported Metrics:

Cosine Similarity - cs
Pearson Correlation Coefficient - pcc
Euclidean Distance - ed

Notebook:

To conduct iterative merging experiments, you can use following notebook for a quick start.

Notebook for Iterative Merging

This notebook includes 3 main functions:

Selection - calculate the model kinship to predict the potential benefit of providing merge.
Merging - merge the providing models.
Recycling - upload the merged model (evaluation is optional).

Acknowledgement

We would like to express our gratitude to the developers and contributors of the following external toolkits, which were instrumental in the success of our research on model merging and kinship analysis:

lm-evaluation-harness by EleutherAI for providing a comprehensive evaluation framework for large language models.
mergekit by arcee-ai for offering an essential toolkit for model merging experiments.

These toolkits have significantly contributed to our ability to conduct and reproduce large-scale experiments, and their open-source availability has been invaluable to the broader research community.

Citation:

@misc{hu2024exploringmodelkinshipmerging,
      title={Exploring Model Kinship for Merging Large Language Models}, 
      author={Yedi Hu and Yunzhi Yao and Ningyu Zhang and Shumin Deng and Huajun Chen},
      year={2024},
      eprint={2410.12613},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2410.12613}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
metrics		metrics
pics		pics
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Model Kinship for Merging LLMs

Table of Contents

Overview

Installation

Usage

Reproduction

Supported Metrics:

Notebook:

Acknowledgement

Citation:

About

Releases

Packages

Contributors 3

Languages

License

zjunlp/ModelKinship

Folders and files

Latest commit

History

Repository files navigation

Exploring Model Kinship for Merging LLMs

Table of Contents

Overview

Installation

Usage

Reproduction

Supported Metrics:

Notebook:

Acknowledgement

Citation:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages