Mapreduce Tutorial covers Introduction of MapReduce, Definition, Why Map Reduce, Algorithm,Examples, Installation, API (Application Programming interface), Implementation of Mapreduce, Mapreduce Partitioner, Mapreduce Combiner,Administration.
Why to use MapReduce?
Initially created by Google, MapReduce soon gained immense popularity because of its unmatched qualities mandating big data players to deploy it. Some of it unique features are as follows:
|Flexibility||Can be developed in any language like java, c++, python, etc.|
|Scalability||Able to process petabytes of data on single cluster|
|Recovery||Takes care of failure by storing the replica on another machine|
|Lesser data motion||Processing tasks appear on physical nodes which increases the speed in turn.|
Apart from the above key features some of the key highlights of this technology are:
- Map task stores data into local disk while Reduce task in HDFS.
- Map tasks are created for each split of equal size which is equal to an HDFS block~ 64 MB
- Tasktracker sends heartbeat signals to notify about the current state.
This blog will help you get a better understanding of Hadoop MapReduce – What it Refers To?
Mapreduce Tutorial Video