MoICE

Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts"

Getting Started

Let’s take Llama2-7b-chat as an example.

Create a virtural environment from requirements.txt.

pip install -r requirements.txt
Replace original modeling_llama.py with our modeling_llama.py with MoICE.
Replace paths in train.sh and train Llama2-7b-chat with MoICE.

bash train.sh

Test

we take the open long-context benchmark Leval as our main evaluation.

Citation

@article{lin2024mixture,
  title={Mixture of In-Context Experts Enhance LLMs' Long Context Awareness},
  author={Lin, Hongzhan and Lv, Ang and Chen, Yuhan and Zhu, Chen and Song, Yang and Zhu, Hengshu and Yan, Rui},
  journal={arXiv preprint arXiv:2406.19598},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
modeling		modeling
training		training
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoICE

Getting Started

Test

Citation

About

Releases

Packages

Languages

p1nksnow/MoICE

Folders and files

Latest commit

History

Repository files navigation

MoICE

Getting Started

Test

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages