Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This change is going to be a prerequisite to making ChatterBot's response selection process more efficient. Using categorized training data from the chatterbot corpus, it will be possible to do text classification on input statements to place them in a group based on the existing training data. Once tagged, a strong assumption can be made by the chat bot that the appropriate response to the input statement exists within the set of statements that have the same tag, or category as the input statement.
Because filtering the results in the database based on a named attribute can be made to be fast and efficient, this will help solve a large portion of the performance problems that the current version of ChatterBot faces (~0.7).
Note: this pull request will only add the tag attributes to statements. Future pull requests will be required to:
a. Categories for data in the ChatterBot corpus
b. File names in the Ubuntu Dialog corpus