Semantic analyzer

A semantic analyzer of sentences and questions based on Language processing algorithms (with Pytorch :) ).

1. 'Are the questions similar?' problem

In this problem, we compare similarity between two questions. The dataset used here comes from the Quora Question Pairs challenge.

Our solution with a pretrained BERT model built with Pytorch.

How does it work?

python3 same_analyze.py "Am I wrong?" "Do you love ice scream?"
Same at 0.15%

python3 same_analyze.py "How do I save videos from twitter?" "How do you upload videos from your camera roll onto Twitter?"
Same at 10.42%

python3 same_analyze.py "How do I save videos from twitter?" "How do you upload videos from your camera roll onto Twitter?"
Same at 97.04%

Details are in the notebook qqp_BERT.ipynb

2. 'Is this comment positive?' problem

Here, we evaluate how positive is a comment sent. The dataset used here to train the model come from The Stanford Sentiment Treebank dataset

Our solution is a BiLSTM model trained on a negative-positive classification task. We embed words with the Word2Vec Gensim model trained with the glove-wiki-gigaword-50 corpus.

How does it work?

python3 sent_analyze.py "I love this movie"
Positive at 100.0%

python3 sent_analyze.py "A great idea becomes a not-great movie."
Positive at 0.07%

Details are in the notebook sentiment_analysis_BiLSTM_v2.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
1g-word-1m-benchmark-r13output		1g-word-1m-benchmark-r13output
LICENSE		LICENSE
README.md		README.md
cola_BiLSTM.ipynb		cola_BiLSTM.ipynb
cola_fasttext.ipynb		cola_fasttext.ipynb
main.py		main.py
make_corpus.py		make_corpus.py
models.py		models.py
qqp_BERT.ipynb		qqp_BERT.ipynb
qqp_BiLSTM.ipynb		qqp_BiLSTM.ipynb
qqp_BiLSTM_v2.ipynb		qqp_BiLSTM_v2.ipynb
qqp_BiLSTM_v3.ipynb		qqp_BiLSTM_v3.ipynb
qqp_DATA.ipynb		qqp_DATA.ipynb
same_analyze.py		same_analyze.py
sent_analyze.py		sent_analyze.py
sentiment_analyis_DATA.ipynb		sentiment_analyis_DATA.ipynb
sentiment_analysis_BERT.ipynb		sentiment_analysis_BERT.ipynb
sentiment_analysis_BiLSTM_v1.ipynb		sentiment_analysis_BiLSTM_v1.ipynb
sentiment_analysis_BiLSTM_v2.ipynb		sentiment_analysis_BiLSTM_v2.ipynb
sentiment_analysis_BiLSTM_v3 (embiddings).ipynb		sentiment_analysis_BiLSTM_v3 (embiddings).ipynb
sentiment_analysis_DATA_bert.ipynb		sentiment_analysis_DATA_bert.ipynb
sentiment_analysis_DATA_bilstm.ipynb		sentiment_analysis_DATA_bilstm.ipynb
tp_lstm_classifier.ipynb		tp_lstm_classifier.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic analyzer

1. 'Are the questions similar?' problem

2. 'Is this comment positive?' problem

About

Releases

Packages

Languages

License

medric49/semantic_analyzer

Folders and files

Latest commit

History

Repository files navigation

Semantic analyzer

1. 'Are the questions similar?' problem

2. 'Is this comment positive?' problem

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages