TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding Tag/Word Relations and More Fine-Grained Tags
- python (3.8)
- cuda (11.4)
- torch (1.8.1)
- pip install -r requirements.txt
Download the pre-training model at the following link:
- BioBERT: https://huggingface.co/dmis-lab/biobert-base-cased-v1.2/tree/main
- ClinicalBERT: https://huggingface.co/emilyalsentzer/Bio_ClinicalBERT/tree/main
- Get dataset
- Process them to fit the same format as the example in
data/
- Put the processed data into the directory
data/
python main.py --config ./config/cadec.json