Entity Linking

Step1 : Graph Embedding using the DistMult model (https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/emnlp15.pdf)

Setup

Using the name of the database (<data_name>) create a folder in $PROJECT_ROOT/data. e.g. $PROJECT_ROOT/data/umls or $PROJECT_ROOT/data/fb15k-237
In the above create another folder raw_text and copy all the following data resources here.
- train.txt
- valid.txt
- test.txt
The knowledge data is in the format : entity1 \t entity2 \t relation \t 1
Create a conf file with <data_name>.conf in the folder $PROJECT_ROOT/config and update the parameters as given in config/umls.conf

Processing Data

./scripts/preprocess.sh <data.conf> graph

Training Models

To train a model use the run script with a data config and a model config like this:

IF running on CPU : ./scripts/train.sh configs/umls.conf config/dist_mult.conf graph

IF running on GPU : ./scripts/train.sh configs/umls.conf config/dist_mult.conf graph use_gpu

Step2 : Mention Context Embedding (http://cogcomp.org/page/publication_view/817)

Setup

Using the name of the database (<data_name>) create a folder in $PROJECT_ROOT/data. e.g. $PROJECT_ROOT/data/umls or $PROJECT_ROOT/data/ncbi_disease_corpus
In the above create another folder raw_text and copy all the following data resources here.
The data is in the PUBTATOR format
Create a conf file with <data_name>.conf in the folder $PROJECT_ROOT/config and update the parameters as given in config/ncbi_disease_corpus.conf

Processing Data

./scripts/preprocess.sh <data.conf> mentions

Training Models

To train a model use the run script with a data config and a model config like this:

IF running on CPU : ./scripts/train.sh config/ncbi_disease_corpus.conf config/joint_context.conf mentions

IF running on GPU : ./scripts/train.sh config/ncbi_disease_corpus.conf config/joint_context.conf mentions use_gpu

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
data		data
resources		resources
scripts		scripts
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Entity Linking

Step1 : Graph Embedding using the DistMult model (https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/emnlp15.pdf)

Setup

Processing Data

Training Models

Step2 : Mention Context Embedding (http://cogcomp.org/page/publication_view/817)

Setup

Processing Data

Training Models

About

Releases

Packages

Languages

agankur21/entity_disambiguation

Folders and files

Latest commit

History

Repository files navigation

Entity Linking

Step1 : Graph Embedding using the DistMult model (https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/emnlp15.pdf)

Setup

Processing Data

Training Models

Step2 : Mention Context Embedding (http://cogcomp.org/page/publication_view/817)

Setup

Processing Data

Training Models

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages