Skip to content

timothy22000/Spark-Seperate-And-Conquer-Algorithm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spark-Seperate-And-Conquer-Algorithm

MSc Thesis

  • This repository contains code for my MSc work with regards to implementing a Separate and Conquer Algorithm in Spark as well as its evaluation metrics (Accuracy, Precision, Recall) that was used to analyze large datasets using distributed in-memory data processing capabilities in Spark across a number of commodity machines (1, 5, 10, 15, 20) in a cluster.

Setting up Spark

For Mac Users, you can use brew install apache-spark to easily install Spark.

Alternatively, you can download it from:

Quick start on Spark and to test your Spark installation: http://spark.apache.org/docs/latest/quick-start.html

Additional Notes:

The Maven Dependency required in the project:

<dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-core_2.10</artifactId>
    <version>1.6.2</version>
</dependency>

If you are interested, you can read about my project and problems that I encountered: [Wiki Page With Project Progress] (http://timothy22000.wikidot.com/main)

Releases

No releases published

Packages

No packages published

Languages