ICP8 Apache Spark - gabriellawillis/BigData GitHub Wiki
Source code can be found here: SourceCode
Spark Programming - Word Count
- Write a spark program using text data as the input to count number of words
IntelliJ Word Count Code Snippet:
Word Count Output:
Secondary Sorting in Map Reduce
- Sort the values in the reducer phase
IntelliJ Sorting Code Snippet:
Sorting Output:
References: