Basic text to numbers tokenizer for machine learning
-
Updated
Feb 8, 2017 - Ruby
Basic text to numbers tokenizer for machine learning
Lexers, tokenizers, parsers, compilers, renderers, stringifiers... What's the difference, and how do they work?
Yet another powerful tokenizer in js.
A PHP Library to extract n-grams from a text. Simple preprocessing tools (cleaning, tokenizing) included.
Create a snapdragon token. Used by the snapdragon lexer, but can also be used by plugins.
simple regex for correcting punctuations
Uses babel to extract JavaScript code comments from a string. Returns an array of comment objects, with line, column, index, comment type and comment string.
A pythonic wrapper for Stanford CoreNLP.
Extract JavaScript code comments from a string or glob of files.
textmining_project
Add a description, image, and links to the tokenize topic page so that developers can more easily learn about it.
To associate your repository with the tokenize topic, visit your repo's landing page and select "manage topics."