Skip to content

Concepts: Distributed Database Management Systems and Non-Relational Data Models - Setting up Hadoop in a fully distributed mode, using Hadoop Distributed File System (HDFS) and MapReduce (in Java and Python), and using a non-relational database based on a wide-column data model (HBase).

License

Notifications You must be signed in to change notification settings

course-files/DistributedDatabases-HDFS-MapReduce-WideColumn

Repository files navigation

Lab on Distributed Databases: Hadoop Distributed File System, MapReduce, and Wide-Column Data Models

Key Value
Course Codes MIT 8107
Course Names MIT 8107: Advanced Database Systems (Week 10-12 of 13)
Semester May to August 2025
Lecturer Allan Omondi
Contact aomondi@strathmore.edu
Note The lecture contains both theory and practice.
This lab forms part of the practice.
It is intended for educational purposes only.
Recommended citation: BibTex

Lab Manual and Instruction Files

Refer to the instruction files below for more details:

  1. Hadoop Setup in a Fully Distributed Mode
  2. HDFS and MapReduce using Java and Python
  3. HBase Setup in a Standalone Mode

Technology Stack and Process Flow

image

Apache® and other trademarks or logos are registered trademarks or logos. No endorsement by The Apache Software Foundation or other organizations is implied by the use of these trademarks or logos. This is meant for educational purposes only.

About

Concepts: Distributed Database Management Systems and Non-Relational Data Models - Setting up Hadoop in a fully distributed mode, using Hadoop Distributed File System (HDFS) and MapReduce (in Java and Python), and using a non-relational database based on a wide-column data model (HBase).

Topics

Resources

License

Stars

Watchers

Forks