Keyword Frequency Analysis

This Python script generates and analyzes keyword frequency distributions, comparing random and power-law distributions. It visualizes the fit of these distributions using the power-law model and performs statistical tests to evaluate the goodness of fit.

Requirements

numpy
powerlaw
matplotlib
random
collections
nltk

Code Overview

Generate Keywords

100 unique English words are randomly selected.

Generate Frequencies

Set 1: Randomly generated keyword frequencies.
Set 2: Frequencies following a power-law distribution (Zipf's law).

Fit and Compare Distributions

Fit both sets to a power-law distribution and compare using log-likelihood ratio.
Print results and p-values.

Plot Results

Visualize the frequency distributions and power-law fits for both keyword sets.

Output

Keywords from Power-Law Set: Prints top 10 keywords with their frequencies.
Fitting Results: Displays alpha values and log-likelihood ratios for both random and power-law sets.
Plots: Generates two plots showing the keyword frequency distributions and the power-law fit.

Example Output

Random Set:
Alpha: 1.87
Xmin: 1
Log-likelihood ratio: -4.32
p-value: 0.023
Power-Law Set:
Alpha: 2.12
Xmin: 1
Log-likelihood ratio: -1.56
p-value: 0.110

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
keyword_frequency_distribution.png		keyword_frequency_distribution.png
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keyword Frequency Analysis

Requirements

Code Overview

Generate Keywords

Generate Frequencies

Fit and Compare Distributions

Plot Results

Output

Example Output

About

Releases

Packages

Languages

jamesdhope/keyword-power-law-analysis

Folders and files

Latest commit

History

Repository files navigation

Keyword Frequency Analysis

Requirements

Code Overview

Generate Keywords

Generate Frequencies

Fit and Compare Distributions

Plot Results

Output

Example Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages