End-to-end earthquake detection pipeline via efficient time series similarity search
-
Updated
Jul 6, 2023 - Jupyter Notebook
End-to-end earthquake detection pipeline via efficient time series similarity search
A Clojure library for querying large data-sets on similarity
SetSketch: Filling the Gap between MinHash and HyperLogLog
A simple audio fingerprinting system
There are Python 2.7 codes and learning notes for Spark 2.1.1
A text similarity computation using minhashing and Jaccard distance on reuters dataset
insight data engineering fellow project
MinHash and LSH index written in Rust for Node.js
An improved method of locality-sensitive hashing for scalable instance matching. In this study, we propose a scalable approach for automatically identifying similar candidate instance pairs in very large datasets utilizing minhash-lsh-algorithm in C#.
Minhash clustering of text documents
An easy-to-use script for fast similarity search in the textual data (and embedding space) with GPU & Multi-core support.
Fast Jaccard similarity search for abstract sets (documents, products, users, etc.) using MinHashing and Locality Sensitve Hashing
Project 1: Similar document searching via MinHash and Locality Sensitive Hashing
Implementation of a B+ Tree for range and exact match queries and of the LSH algorithm for finding similar documents as measured by Jaccard Similarity.
📃Document similarity detection using hashing
A set of methods and model evaluation metrics for predicting links in an academic citation network using Apache Spark and Scala
Scalable Data Mining - Assignment submissions
Recommendation systems for Yelp (collaborative filtering & content-based)
documents my master's level thesis work on building continous, topical web crawler based on mercator 1999
Add a description, image, and links to the minhash-lsh-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the minhash-lsh-algorithm topic, visit your repo's landing page and select "manage topics."