steven-s / minhash-document-clusters Star 5 Code Issues Pull requests Minhash clustering of text documents text-mining clustering lsh minhash locality-sensitive-hashing document-clustering minhash-lsh-algorithm Updated Sep 29, 2017 Scala
vbarzokas / apache-spark-link-prediction Star 2 Code Issues Pull requests A set of methods and model evaluation metrics for predicting links in an academic citation network using Apache Spark and Scala scala apache-spark tf-idf prediction-algorithm minhash-lsh-algorithm citation-network Updated Nov 3, 2020 Scala