Word2Vec Negative_Sampling

This is an efficient implementation of Word2vec on game of thrones textbooks
Note: This implemented on Windows OS, please find all path strings and change \ with / if running on linux or Mac

Dependecies:

Tensorflow
Python 3
Numpy
os
argpase
glob

Train From Scratch

if you would like to run the model yourself and configure the hyper-parameters specified in main.py please do delete the following folders first to avoid conflicts when running tensorflow:

visualizations
graph
checkpoints

To train from scratch have a look at main.py and choose the hyper-parameters you would like to experiment with, there is only 2 mandatory arguments --data-dir and --vocab-dir

python main.py --data-dir data\\ --vocab-dir vocab\\

Just-Visualization

you can use my trained model and run tensorboard to visualize the word vectors generated; to do so:

1- open terminal (cmd on windows) 
2- Navigate to visualizations folder
3- run 
tensorboard --logdir=visualizations`
4- copy and paste the url provided by tensorboard in chrome
5- load the vocab_3000.tsv file located in visualization folder in tensorboard to identify each word

Evaluation

you may as well run evaluate.py to find analogies and nearest words regarding game of thrones
my favourite one is
Mother is to Joffrey as "ghost/Sam" is to Jon

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
__pycache__		__pycache__
checkpoints		checkpoints
data		data
graph		graph
visualizations		visualizations
vocab		vocab
LICENSE.md		LICENSE.md
README.md		README.md
application_of_word2vec.py		application_of_word2vec.py
data_generator.py		data_generator.py
evaluate.py		evaluate.py
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word2Vec Negative_Sampling

Dependecies:

Train From Scratch

Just-Visualization

Evaluation

About

Releases

Packages

Languages

License

fakhouri-junior/NCE_Word2Vec

Folders and files

Latest commit

History

Repository files navigation

Word2Vec Negative_Sampling

Dependecies:

Train From Scratch

Just-Visualization

Evaluation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages