ICP_13 - PallaviArikatla/Big-Data-Programming GitHub Wiki
INTRODUCTION: analysis on Graph Frames and GraphXAlgorithms in Spark.
IMPLEMENTATION:
Question 1:
Import the dataset as a csv file and create data frames directly on import than create graph out of the data frame created.
- Make initial setup.
- Install all the necessary libraries.
- Import CSV files.
- Stations data frame created.
- Then create vertices and edges from the imported data frames to create the graph.
- Create edges from Trips data frame.
- Create graph frame using vertices and edges created.
Question 2: Triangle Count.
- Triangle count on the created graph.
Question 3:
Find Shortest Paths w.r.t. Landmarks.
- Shortest path will be calculated and gets created on the graph.
Question 4: Apply Page Rank algorithm on the dataset.
- Apply Page rank algorithm, it identifies important vertices on the graph based on connections.
Question 5: Save graphs to a file.
Bonus Question:
Question 1: Apply Label Propagation Algorithm.
Question 2: BFS Algorithm.