Map Reduce HDFS - praveenpoluri/Big-Data-Programing GitHub Wiki

Hadoop MapReduce and Hadoop Distributed File System (HDFS)

Task:

  • Count of words in a text file that starts with 'a'
  • Indentifying prime numbers in a list of numbers

Features:

  • Hadoop
  • Map-Reduce
  • HDFS

Questions:

Word Count: calculate the frequency of the words in a text file:

code:

input:

output:

wordcount for words starting with 'a':

code:

input:

output:

Prime number:

code:

input:

output: