Big_Data_Programming_ICP_1 - kusamdinesh/Big-Data-and-Hadoop GitHub Wiki
Cloudera
A software platform running in the cloud or on-premises for data engineering, data warehousing, machine learning and analytics. Cloudera began as an open-source hybrid Apache Hadoop distribution, CDH(Cloudera Distribution Including Apache Hadoop), which targeted the company-class deployments of that technology.
Cloudera Setup file
Cloudera Environment in VirtualBox
Excercise
Hue Visualization
Shakespeare Text File
Word List Text File