Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
-
Updated
May 11, 2021 - Go
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
A lightweight Go package implementing Charikar's Simhash algorithm for generating hash fingerprints and calculating similarity, ideal for deduplication and content fingerprinting
A barebones implementation of the simhash data sketching algorithm.
A high-performance key-value store in Go with LSM tree, compaction algorithms, rate limiting, and support for probabilistic data structures like Bloom Filter and SimHash. It also features range scan and list operations with pagination.
Add a description, image, and links to the simhash topic page so that developers can more easily learn about it.
To associate your repository with the simhash topic, visit your repo's landing page and select "manage topics."