ICP8 Apache Spark - gabriellawillis/BigData GitHub Wiki

Source code can be found here: SourceCode

Spark Programming - Word Count

  • Write a spark program using text data as the input to count number of words

IntelliJ Word Count Code Snippet:

Word Count Output:

Secondary Sorting in Map Reduce

  • Sort the values in the reducer phase

IntelliJ Sorting Code Snippet:

Sorting Output:

References: