A simple python repository for developing perceptron based text mining involving dataset linguistics preprocessing for text classification and extracting similar text for a given query.
machine-learning text-mining text-classification optimization linguistics nltk tf-idf perceptron l2-regularization tokenization lemmatization cosine-similarity-scores information-retreival torch-sparse-matrix
-
Updated
Mar 25, 2022 - Jupyter Notebook