ICP 12 - Gnkhakimova/CS5590-BigData GitHub Wiki

ICP 12

Source Code
For following ICP we had to use DataFrames and perform different operations on it.

Task 1

Import the dataset as a csv file and create data frames directly on import than create graph out of the data frame created. Output DataFrame

Task 2

Concatenate chunks into list & convert to DataFrame

Task 3

Remove duplicates

Task 4

Name Columns (renamed Duration to Path)

Task 6

Create vertices and shows them

Task 7

Show some edges

Task 8

Vertex in-Degree

Task 9

Vertex out-Degree

Task 10

Apply the motif findings.