Lab 2 - PoojaShekhar/CS5543-Real-Time-Big-Data-Analytics--Lab-assignments GitHub Wiki

****## Needs REVISION ****

## Question

Spark Programming: Write a spark program with an interesting use case using text data as the input and program should have at least Two Spark Transformations and Two Spark Actions. Present your use case in map reduce paradigm as shown below (for word count).

Problem Scenario:

Facebook keeps suggesting us mutual friends when we have common friends.I have tried using Spark to find mutual friend in a self constructed data-set containing:

Input File

Input File

Design & Implementation

Screenshots

References:

http://xjlin0.github.io/tech/2015/08/30/suggesting-friends-by-mapreduce/ https://github.com/xjlin0/cs246/tree/master/w2015/hw1