HDFS - praveenpoluri/Big-Data-Programing GitHub Wiki
ICP 1
Class Id: 14
ToDo:Installing Cloudera and visualize Hadoop data with Hue
Tasks:
- Install Cloudera
- Load datasets into HDFS
- Append both files
- Visualize the result file with Hue
- Display first and last 5 lines of the result file
- Load new file and append data of all the 3 datasets
Tools:
- Cloudera
- Hadoop
- Hue
Task 1:
Create New directory in hdfs:
Copy local files into hdfs:
Task 2: Append second file to first file
Task 3: merge to files into new file
Task4: Display first five lines