Author implementation of this NAACL 2021 paper.
-
Install pytorch 1.8.1 that fits your CUDA version
-
Install the rest of required packages
pip install -r requirements.txt
-
Run this command to install NLTK punkt.
python -c "import nltk; nltk.download('punkt'); nltk.download('stopwords')"
-
Download the dataset from the official Spider dataset website
-
Edit the config files
configs/defaults.jsonnet
to update the location of the dataset:
local dataset_path = "dataset/";
Use the following command to train:
python exec.py
First time loading of the dataset might take a while (a few hours) since the model first loads values from tables and calculates similarity features with the relevant question. It will then be cached for subsequent runs.