Skip to content

Uses NLTK to extract hyponyms and hypernyms for words and phrases mapped in WordNet

Notifications You must be signed in to change notification settings

joneszc/WordNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

WordNet-string-parser

Uses NLTK to extract hyponyms and hypernyms for words and phrases mapped in WordNet from an input string.

The string is broken into iterations of permutations for pairing underscore-linked chunks of text with phrases matched in WordNet.

A pre-processing step creates a new version of the string that is optimized for WordNet performance with the WordNetLemmatizer and the stripping of punctuation.

Both the pre-processed string and original string are tokenized and matches with WordNet are returned, after removal of duplicates using set().

All hyponyms and hypernyms are returned for detected WordNet synset matches from the tokenized string.

The output is a list of two dictionaries pertaining to both the hyponym matches and hypernym matches.

About

Uses NLTK to extract hyponyms and hypernyms for words and phrases mapped in WordNet

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages