Home - acikgozmehmet/BigDataProgramming GitHub Wiki

BigDataProgramming


This course aims to provide the opportunity to walk through hands-on examples with Hadoop and Spark frameworks. This course introduces developers to the Hadoop ecosystem and focus on multiple programming models, including MapReduce, Pig, Hive, Sqoop, Flume, Solr, Oozie, HBase/Casandra/MongoDB, and Apache Spark. Students obtain the skills and knowledge about Hadoop architecture, software stack and execution environment and learn to implement Big Data applications with Hadoop and Spark. Students build applied programming skills using case studies such as Public Sector Service. Healthcare, Business and Learning Services. Programming will be with Java, Scala and Python.

Please feel free to check out the following pages for more information.

ICP-01 : Cloudera-and-Hue

ICP-02 : Hadoop-and-MapReduce

ICP-03 : MapReduce

ICP-04 : Hive

ICP-05 : Sqoop

ICP-06 : Solr-and-Lucene

ICP-07 : Cassandra

ICP-08 : Apache Spark Introduction

ICP-09 : Apache Spark Practice

ICP-10 : DataFrames&SQL in Scala Pyspark

ICP-11 : Apache Spark Streaming

ICP-12 : Graph-Frames and GraphX

ICP-13 : Graph-Frames and GraphX Algorithms

ICP-14 : Apache Spark MLIB