This is a simple tool for text dataset analysis and multiple datasets comparison. Keywords: corpus, text dataset, text distribution, part-of-speech(pos), zipf's-law, distinct value, concreteness
python nlp natural-language-processing text corpus seaborn matplotlib part-of-speech zipfs-law concreteness distinct-value
-
Updated
Apr 11, 2022 - Jupyter Notebook