
MapReduce is a programming model for processing and generating large data sets with a parallel, distributed algorithm on a cluster.
MapReduce libraries have been written in many programming languages, with different levels of optimization. A popular open-source implementation that has support for distributed shuffles is part of Apache Hadoop.
1: Learn MapReduce with Playing Cards
2: Intro To MapReduce
Intro to MapReduce concepts. Explores the flow of a MapReduce program.
3: MapReduce Programming Demo
This video walks through the Java code for an example MapReduce program, and then demonstrates how to run the Hadoop job on a MapR cluster. If you’re new to MapReduce, watch the Introduction to MapReduce video first.