WordNet-string-parser

Uses NLTK to extract hyponyms and hypernyms for words and phrases mapped in WordNet from an input string.

The string is broken into iterations of permutations for pairing underscore-linked chunks of text with phrases matched in WordNet.

A pre-processing step creates a new version of the string that is optimized for WordNet performance with the WordNetLemmatizer and the stripping of punctuation.

Both the pre-processed string and original string are tokenized and matches with WordNet are returned, after removal of duplicates using set().

All hyponyms and hypernyms are returned for detected WordNet synset matches from the tokenized string.

The output is a list of two dictionaries pertaining to both the hyponym matches and hypernym matches.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
wordnet_hnym_extractor.py		wordnet_hnym_extractor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WordNet-string-parser

About

Releases

Packages

Languages

joneszc/WordNet

Folders and files

Latest commit

History

Repository files navigation

WordNet-string-parser

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages