REIGN: Robust Training for Conversational Question Answering Models via Reinforced Reformulation Generation

Description

This repository contains the code and data for our WSDM'24 full paper. Our method REIGN (REInforced GeNeration) aims to to increase the robustness of ConvQA models. It strengthens the model's training by exposing it upfront to a larger variety of intent-preserving surface forms for the same training sample. We proprose a reinforcement learning model based on Deep-Q networks that is used as a means for model-specific data augmentation: it learns to select only the top-k reformulations that would be used for additional training data for the QA model for maximum performance improvement.

Example showing the main processing steps in REIGN: Predicting most suitable reformulation categories (defined by our reformulation taxonmoy) based on QA performance metrics and generating reformulations for predicted categories (RG).

For more details see our paper: Robust Training for Conversational Question Answering Models via Reinforced Reformulation Generation and visit our project website: https://reign.mpi-inf.mpg.de.

If you use this code, please cite:

@article{kaiser2023robust,
  title={Robust Training for Conversational Question Answering Models with Reinforced Reformulation Generation},
  author={Kaiser, Magdalena and Roy, Rishiraj Saha and Weikum, Gerhard},
  booktitle={WSDM},
  year={2024}
 }

Setup

In our work, we conduct experiments on two ConvQA datasets, namely ConvQuestions and ConvMix and showcase the effectiveness of the REIGN framework for two ConvQA models, namely CONQUER and EXPLAIGNN.

All code was tested on Linux with Python 3.8.

To install the required libraries, it is recommended to create a virtual environment:

   python3 -m venv REIGN_ENV 
   source REIGN_ENV/bin/activate
   pip install --upgrade pip
   pip install -r requirements.txt

Data

To initialize the repo (download data, benchmark, models and our main results), run:

bash initialize.sh

Dependencies

For running experiments with EXLPAIGNN, the following steps are required:

Create a separate virtual environment for EXPLAIGNN:

cd EXPLAIGNN/
conda env create --file conda-explaignn.yml
conda activate explaignn
pip install -e .

EXPLAIGNN makes use of CLOCQ for retrieving relevant evidences. CLOCQ can be conveniently integrated via the publicly available API, using the client from the repo, it can be installed via:

bash make install_clocq

To initialize EXPLAIGNN, run (inside the EXPLAIGNN directory):

bash scripts/initialize.sh 
bash scripts/reign_init.sh

We use [ELQ] (https://github.com/facebookresearch/BLINK.git) for entity linking. Clone the repo into the REIGN directory and download the required elq models via:

git clone https://github.com/facebookresearch/BLINK.git
cd BLINK
chmod +x download_elq_models.sh
./download_elq_models.sh

Reformulation Generation (RG)

Our RG model is based on BART and is trained with distant supervision data; for training the model as well as producing reformulations for each of our 15 reformulation categories for each question in the respective dataset, run:

bash rg/BART/configs/BENCHMARK_refs.sh DATATYPE

where BENCHMARK is convmix or convquestions_seed and DATATYPE is train or dev

Reformulation Category Selection (RCS)

The RCS model learns to select which reformulations to add for a respective QA model and for each question in the BENCHMARK; first, it collects rewards for how well the initial QA model can answer the respective reformulations, then these rewards are used in the RL training of the RCS model, and finally, the trained RCS model is used to decide which categories to use to augment the QA training data; these steps are performed by running the respective scripts:

bash rcs/configs/rcs_QAMODEL_BENCHMARK.sh

where QAMODEL is either conquer or explaignn and BENCHMARK is convmix or convquestions_seed

Question Answering with augmented training data

In this repo, we provide some modified versions (for making evaluating/training with reformulations easier) of the original CONQUER and EXPLAIGNN code. The original code can be found here: CONQUER: https://github.com/magkai/CONQUER, EXPLAIGNN: https://github.com/PhilippChr/EXPLAIGNN.

CONQUER

For training and evaluating CONQUER with reformulations, run:

bash qa/CONQUER/configs/BENCHMARK/reign_conquer.sh

For comparison with CONQUER without reformulations, run:

bash qa/CONQUER/configs/BENCHMARK/conquer.sh

where BENCHMARK is either convmix or convquestions

EXPLAIGNN

For training EXPLAIGNN with reformulations, go into qa/EXPLAIGNN and run (using the EXPLAIGNN virtual environment):

bash scripts/pipeline.sh --train configs/BENCHMARK/reign_explaignn.yml "kb"

where BENCHMARK is either convmix or convquestions

For comparison with EXPLAIGNN without reformulations, run:

bash scripts/pipeline.sh --train configs/BENCHMARK/explaignn.yml "kb"

For evaluating on the GPT reformulations run:

bash scripts/pipeline.sh --gpt-eval configs/BENCHMARK/reign_explaignn_gpt_eval.yml "kb"

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
gpt_eval		gpt_eval
preprocessing		preprocessing
qa		qa
rcs		rcs
rg		rg
utils		utils
LICENSE		LICENSE
README.md		README.md
example.png		example.png
initialize.sh		initialize.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REIGN: Robust Training for Conversational Question Answering Models via Reinforced Reformulation Generation

Description

Setup

Data

Dependencies

Reformulation Generation (RG)

Reformulation Category Selection (RCS)

Question Answering with augmented training data

CONQUER

EXPLAIGNN

About

Releases

Packages

Languages

License

magkai/REIGN

Folders and files

Latest commit

History

Repository files navigation

REIGN: Robust Training for Conversational Question Answering Models via Reinforced Reformulation Generation

Description

Setup

Data

Dependencies

Reformulation Generation (RG)

Reformulation Category Selection (RCS)

Question Answering with augmented training data

CONQUER

EXPLAIGNN

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages