Program for entity linkage of a server log file. Finds server processes that are presumably the same or are quite similar. The edit/Jaccard distances can be manually adjusted to fit the query or accommodate larger logs. Check the pyspark config if you have enough memory, adjust if not.
INFODIS v1.1 is the most recent version