Map Reduce Replicated

Implemented the core concepts of Hadoop's Map Reduce Framework.

Can handle big data computation with enough modularity for the user to input any mapper/reducer appropriately.

Overview of the project

We have setup a multinode environment consisting of a master node and multiple worker nodes. A client program communicates with the nodes based on the types of operations requested by the user. The types of operations handled by this project are:

WRITE: Given an input file, split it into multiple partitions and store it across multiple worker nodes.
READ: Given a file name, read the different partitions from different workers and display it to the user.
MAP-REDUCE - Given an input file, a mapper file and a reducer file, execute a MapReduce Job on the cluster.

Requirements

Install python 3.8

Packages required -

Flask
requests
json
contextlib
subprocess
os

To run the project

Run the client file and the master_node file on 2 seperate terminals by using the following commands-

python client.py

python master_node.py

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
README.md		README.md
client.py		client.py
log_file.txt		log_file.txt
mapper_file.py		mapper_file.py
master_node.py		master_node.py
reducer_file.py		reducer_file.py
sample.txt		sample.txt
sample_reduce.txt		sample_reduce.txt
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Map Reduce Replicated

Overview of the project

Requirements

To run the project

About

Releases

Packages

Contributors 4

Languages

Arushi2002/Yet_Another_Map_Reduce

Folders and files

Latest commit

History

Repository files navigation

Map Reduce Replicated

Overview of the project

Requirements

To run the project

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages