HDFS - praveenpoluri/Big-Data-Programing GitHub Wiki

ICP 1

Class Id: 14

ToDo:Installing Cloudera and visualize Hadoop data with Hue

Tasks:

  • Install Cloudera
  • Load datasets into HDFS
  • Append both files
  • Visualize the result file with Hue
  • Display first and last 5 lines of the result file
  • Load new file and append data of all the 3 datasets

Tools:

  • Cloudera
  • Hadoop
  • Hue

Task 1:

Create New directory in hdfs:

Copy local files into hdfs:

Task 2: Append second file to first file

Task 3: merge to files into new file

Task4: Display first five lines

Create a new text file and load it into hdfs and try to append all three datasets.

Visualize using Hue: