Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 344 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 344 Bytes

Count Vectorizer

This repo, is part of my notes from the udemy course "Machine Learning: Natural Language Processing in Python (V2)". The word-to-vector transformation methods are compared for their accuracy in a given dataset. Four methods used to transform word-to-vectors are Stopwords, simple_tokenizer, LemmaTokenizer and StemTokenizer.