Skip to content
/ NLL-IE Public

Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021

License

Notifications You must be signed in to change notification settings

wzhouad/NLL-IE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

NLL-IE

Code for EMNLP 2021 paper Learning from Noisy Labels for Entity-Centric Information Extraction.

Requirements

Dataset

The TACRED dataset can be obtained from this link. The TACREV dataset can be obtained following the instructions in tacrev. The original CoNLL dataset can be obtained from this link. The revised CoNLL test dataset can be obtained from this link. The expected structure of files is:

NLL-IE
 |-- re
 |    |-- data
 |    |    |-- train.json        
 |    |    |-- dev.json
 |    |    |-- test.json
 |    |    |-- dev_rev.json
 |    |    |-- test_rev.json
 |-- ner
 |    |-- data
 |    |    |-- train.txt     
 |    |    |-- dev.txt
 |    |    |-- test.txt
 |    |    |-- conllpp_test.txt

Training and Evaluation

Train the RE/NER model on with the following command:

>> python train.py

The training loss and evaluation results on the dev set are synced to the wandb dashboard.

About

Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages