Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 879 Bytes

README.md

File metadata and controls

12 lines (8 loc) · 879 Bytes

SANSA-RDF-EntityRank

A Distributed In-Memory Ranking Procedure for Entities of a Class.

Description

SANSA RDF EntityRank is a sub library to read RDF files into Spark or Flink and computes the term-frequency/inverse document frequency for the given RDF datasets. It allows files to reside in HDFS as well as in a local file system and distributes them across Spark RDDs/DataFrames or Flink DataSets.

Contains the SANSA RDF EntityRank for Apache Spark.

Contains the SANSA RDF EntityRank for Apache Flink.