Soft Language Prompts for Language Transfer

This is the source code for the paper Soft Language Prompts for Language Transfer.

Abstract

Cross-lingual knowledge transfer, especially between high- and low-resource languages, remains challenging in natural language processing (NLP). This study offers insights for improving cross-lingual NLP applications through the combination of parameter-efficient fine-tuning methods. We systematically explore strategies for enhancing cross-lingual transfer through the incorporation of language-specific and task-specific adapters and soft prompts. We present a detailed investigation of various combinations of these methods, exploring their efficiency across 16 languages, focusing on 10 mid- and low-resource languages. We further present to our knowledge the first use of soft prompts for language transfer, a technique we call soft language prompts. Our findings demonstrate that in contrast to claims of previous work, a combination of language and task adapters does not always work best; instead, combining a soft language prompt with a task adapter outperforms most configurations in many cases.

Reproducibility

Installation

To replicate our experiments, ensure you have Python version 3.11.8 and install the required packages listed in requirements.txt using the following command:

pip install -r requirements.txt

Reproduction of the results

We have prepared several scripts to facilitate the training of language and task-specific adapters and soft prompts. The following sections describe how to use these scripts.

Ensure that all scripts are executed from the src folder.

Language representations

To train language representations, run:

python -m scripts.lm

This script (scripts/lm.py) trains the language-specific adapters and soft prompts using the Wikipedia dump available on HuggingFace. The exact version of the dump used is specified within the script.

Task representations

We have multiple scripts to train task-specific adapters and soft prompts, as we evaluated various configurations of these methods.

Available scripts include:

scripts/qa - for training task-specific adapters and soft prompts for the question answering task
scripts/ner - for training task-specific adapters and soft prompts for the named-entity recognition task
scripts/xnli - for training task-specific adapters and soft prompts for the natural language inference task
scripts/checkworthy - for training task-specific adapters and soft prompts for check-worthy claim detection

For example, to run the script for question answering, use:

python -m scripts.qa

Cross-lingual evaluation

In our experiments, we assess cross-lingual performance by training task representations in one language and evaluating them in other languages. For this evaluation, use the script scripts/crosslingual.py.

To evaluate trained task representations on English data and test them on other languages, run:

python -m scripts.crosslingual --trained_language english --test_languages german spanish slovak czech telugu

Paper citing

If you use the code or information from this repository, please cite our paper, which will be available on arXiv.

@misc{vykopal2024softlanguagepromptslanguage,
      title={Soft Language Prompts for Language Transfer}, 
      author={Ivan Vykopal and Simon Ostermann and Marián Šimko},
      year={2024},
      eprint={2407.02317},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.02317}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
configs		configs
data		data
results		results
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soft Language Prompts for Language Transfer

Abstract

Reproducibility

Installation

Reproduction of the results

Language representations

Task representations

Cross-lingual evaluation

Paper citing

About

Releases

Packages

Languages

License

kinit-sk/adapter-prompt-evaluation

Folders and files

Latest commit

History

Repository files navigation

Soft Language Prompts for Language Transfer

Abstract

Reproducibility

Installation

Reproduction of the results

Language representations

Task representations

Cross-lingual evaluation

Paper citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages