ICP 1 - a190884810/Big-Data-Programming GitHub Wiki

Lesson Plan #1

Load the file in hadoop hdfs

  • command: Hadoop fs -cat

Append the second file to the first file

  • command: hdfs dfs appendToFile

Visualize file with Hue

Head and tail lines of the combined file

  • 1command: Hadoop fs -cat |head/tail

Create a new text file loaded into hdfs and try to append all three datasets

  • command: hdfs dfs appendToFile /dir The new.txt is the new combined file.