Skip to content

Latest commit

 

History

History
9 lines (6 loc) · 480 Bytes

README.md

File metadata and controls

9 lines (6 loc) · 480 Bytes

ApacheSparkExample

Example project on how to use apache spark for map reduce problem of big data

Introduction

Spark is a new technology, which eases the task of solving map reduce problem. It does not require NFS and can run on local file as well on single cluster machine.

It could also run on hdfs filesystem and is appropriate in a multi-cluster environment.

You could run its jar through a native compiled spark binary, which will run it in hdfs of nfs of some sort.