Big_Data_Programming_ICP_2 - kusamdinesh/Big-Data-and-Hadoop GitHub Wiki
Lesson 2: Hadoop Distributed File System, MapReduce, and BigData Applications
In this class, we will address the problem statement of Hadoop MapReduce and Hadoop Distributed File System (HDFS): 1. Counting the frequency of words in the given input use MapReduce algorithm: using the word count in the Cloudera eclipse to do the odd-even count.
You are required to use your IDE. I used Eclipse Luna here which is already in the distribution of Cloudera Hadoop. Open Eclipse— > Click on New— > Java Project— > Project name— > Finish click. Here I create a project called BigDataProgramming, under which I create a subdirectory named ICP2, under which I create 3 java programs named WordCount.java, NumberCount.java, and CharFreq.java.