M2 ICP 4 - PavankumarManchala/BigDataProgrammingICPs GitHub Wiki

Submitted By:

Pavankumar Manchala

Class Id: 16

Tasks:

Question 1. Spark Streaming using log file generation.

The word count for the log files which are generated above.

Question 2: Write a spark word count program of Spark Streaming received from a data server listening on a TCP socket.

The words are given at terminal, input to the word count program by establishing the TCP connection.

Bonus: Spark Streaming for Character Frequency using TCP Socket.

For this program, instead of going with word count we do character count and displays the words with equal number of characters.