TemporaWiki

Official code for the paper TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models accepted at EMNLP 2022

Use the following to cite our paper:

@inproceedings{jang2022temporalwiki,
  title={TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models},
  author={Jang, Joel and Ye, Seonghyeon and Lee, Changho and Yang, Sohee and Shin, Joongbo and Han, Janghoon and Kim, Gyeonghun and Seo, Minjoon},
  journal={EMNLP 2022},
  year={2022}
}

In order to generate new TemporalWiki (training and evaluation corpus), use the TemporalWikiDatasets repository.

In order to reproduce our results, take the following steps:

1. Create conda environment and install requirements

conda create -n twiki python=3.8 && conda activate twiki
pip install -r requirements.txt

Also, make sure to install the correct version of pytorch corresponding to the CUDA version and environment: Refer to https://pytorch.org/

#For CUDA 10.x
pip3 install torch torchvision torchaudio
#For CUDA 11.x
pip3 install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html

2. Download the preprocessed training and evaluation data (5 snapshots from 2021.08 - 2021.12) used for the experiments on the paper.

To download the Entire Wikipedia Corpus data:

wget https://continual.blob.core.windows.net/elm/Wikipedia_Full.zip

To download TWiki_Diffsets:

wget https://continual.blob.core.windows.net/elm/TWiki_Diffsets.zip

To download TWiki_Probes:

wget https://continual.blob.core.windows.net/elm/TWiki_Probes.zip

Download the data to data directory and unzip it

Finally, download the Initial GPT-2 model checkpoint trained on 08.2021 Wikipedia Snapshot used as the initial model for the paper.

wget https://continual.blob.core.windows.net/elm/model_checkpoints/08/GPT2_large_08_full.ckpt

3. Run the experiment and configuration components

This is an example of performing continual pretraining on TWiki_Diffsets (main experiment) with CKL

python run.py --config configs/training/diff.json

After training the model, run convert_to_fp32.py to convert the fp16 model checkpoints to fp32 checkpoint files to be used for evaluation.

This is an example of performing light-tuning on the pretrained models

python run.py --config configs/light_tuning/GPT2/subset/0801-0901_changed.json

This is an example of getting the TWiki_Probes New zero-shot evaluation of continually pretrained CKL

python run.py --config configs/evaluation/GPT2/subset/0801-0901_changed.json

For components in configuration file, please refer to the Continual-Knowledge-Learning repository.

Generation of Datasets

For Generation of Wikipedia_Full, TWiki_Diffsets, TWiki_Probes, please refer to the TemporalWikiDatasets

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.idea		.idea
configs		configs
fix		fix
models		models
.gitignore		.gitignore
Datasets.py		Datasets.py
README.md		README.md
_test.py		_test.py
convert_to_fp32.py		convert_to_fp32.py
evaluation.py		evaluation.py
evaluation_ppl.py		evaluation_ppl.py
my_run.py		my_run.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TemporaWiki

1. Create conda environment and install requirements

2. Download the preprocessed training and evaluation data (5 snapshots from 2021.08 - 2021.12) used for the experiments on the paper.

3. Run the experiment and configuration components

Generation of Datasets

About

Releases

Packages

Languages

Zabreture/temporalwiki

Folders and files

Latest commit

History

Repository files navigation

TemporaWiki

1. Create conda environment and install requirements

2. Download the preprocessed training and evaluation data (5 snapshots from 2021.08 - 2021.12) used for the experiments on the paper.

3. Run the experiment and configuration components

Generation of Datasets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages