LabAssignment2 - AkhileshGattu/Real-Time-Assignments GitHub Wiki

**Question:**Write a spark program with an interesting use case using text data as the input and program should have at least Two Spark Transformations and Two Spark Actions

** Description:**

Transformations: Map: Map function returns a new distributed dataset where each element is given as input through a function.

SortByKey: SortByKey function pairs up the key value pairs where in K is ordered. This transformation returns the key value pairs in ascending or descending order as specified.

ReduceByKey: This transformation returns key value pairs where all the values for each key are reduced. Number of reduce functions could be specified in the augment.

Actions:

**Take(n): ** The Take action in spark returns the array with first n elements in the dataset.

Count(): The count action in spark returns the number of elements in the dataset.

Screen Shots:

https://github.com/AkhileshGattu/Real-Time-Assignments/blob/master/LabAssignment2/Documentation/Input.JPG

https://github.com/AkhileshGattu/Real-Time-Assignments/blob/master/LabAssignment2/Documentation/LabAssignment2.JPG

https://github.com/AkhileshGattu/Real-Time-Assignments/blob/master/LabAssignment2/Documentation/MapReduce.JPG