ICP2_6 - Hiresh12/Big-Data-Programming GitHub Wiki

Graph Frames and GraphX

Task:

To write a spark program to import datasets and create graph from the dataset using graphX.

Features:

  • Spark
  • python
  • Jupiter Notebook
  • GraphX and Graph frames

Tasks:

Import the dataset as a csv file and create data framesdirectly on import than create graph out of the data frame created.

Triangle Count

Find Shortest Paths w.r.t. Landmarks

Apply Page Rank algorithm on the dataset.

Save graphs generated to a file.

Apply Label Propagation Algorithm

Apply BFS algorithm

References

https://spark.apache.org/docs/latest/graphx-programming-guide.html

https://mapr.com/blog/how-get-started-using-apache-spark-graphx-scala/

https://databricks.com/blog/2016/03/03/introducing-graphframes.html