Icp 2 - gracesyl/big-data-hadoop GitHub Wiki

ICP_2

In this lesson, we are going to discuss about Hadoop MapReduce and Hadoop Distributed File System (HDFS) problem statement: 1.Counting the frequency of words in the given input with MapReduce algorithm: Using the wordcount doing the odd evevcount in cloudera eclipse described follows:

I1

I2

I3

This is how the logic works behind as follows: