Skip to content

NLTK implementatin to extract top 'N' keywords from a text file. This text file is created by transcripting an audio file recorded during a meeting using speech to text API

Notifications You must be signed in to change notification settings

Shruti29/top_keywords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is used to extract top keywords from a transcript of an audio file. This uses NLTK kit from python. Fore more information, please read code comments

This extracts the top keywords from a text file It has two functionalities

  1. Return the top 10 keywords with the count (number of occurences)
  2. Return the top 10 keywords with the timestamp at which they occurred

It creates a word count json in the same directory It created a timestamp json in the same directory

Usage:

If the name of text file is 'sample2'

To get the word count json: keywords_with_word_count('sample2')

To get the timestamp json: keywords_with_timestamps('sample2')

About

NLTK implementatin to extract top 'N' keywords from a text file. This text file is created by transcripting an audio file recorded during a meeting using speech to text API

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages