Skip to content
This repository has been archived by the owner on Mar 2, 2022. It is now read-only.
/ countwords Public archive

Playing with counting word frequencies (and performance) in various languages.

License

Notifications You must be signed in to change notification settings

benhoyt/countwords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Playing with counting word frequencies (and the performance thereof) in various languages. See the full article for context and background: https://benhoyt.com/writings/count-words/

NOTE

I received dozens of pull requests when my count-words article first came out, almost to the point where I couldn't keep up with them. Thanks for all the submissions!

However, I'm not taking new submissions anymore and have archived this repository, due mainly to the amount of time I've spent on this already. The major languages are there, and many minor ones. However, I'm still open to receiving feedback -- you can either email me (via the email address on my website) or open an issue here. Thanks!

To run the benchmarks

To run the tests and benchmarks (you'll need Go, Rust, Python 3, AWK and perhaps other tools installed). Only tested on Linux:

./test.sh
./benchmark.py

For the latest results (run on my machine against these versions), see the article's performance results section.

Credits

Thanks to these contributors for additional language versions:

See other versions on Rosetta Code.