LAB 5 REPORT - SAISRIHARSHAS/Big-Data-Analytics-and-Applications-CS5542 GitHub Wiki

Spark Programming:

Write a spark program for the following Machine Learning Task. Create your own dataset for Image Classification Problem. Use the workflow as discussed in the Tutorial 4 Session using any classification algorithm (e.g., Random Forest, Naïve Bayes) excluding Decision Tree. Report the accuracy and confusion matrix obtained.

IMPLEMENTATION: Input Dataset: Based on our project "to create awareness on climate change". For example, acid rain, ocean acidification, global temperature rise etc.

Note: Used Random Forest Model to classify the image. Key Descriptors:

kMeans Clustering:

Histogram:

Random Forest Model (code): https://github.com/SAISRIHARSHAS/Big-Data-Analytics-and-Applications-CS5542/blob/master/Lab%205/Documentation/spark_programming_screenshots/random_forest_code.png

Random Forest Model:

Confusion Matrix:

Testing:

Final Accuracy:

Client Application using Spark API

Write a client application using the Spark API to connecting between Spark and your client. Your client can be either Web application or Android application. Refer to Tutorial 5 Spark API tutorial.

Answer: SUPSERSTATIC PLUGIN:

Output(console):

Web Application:

-- Google Conversion Actions API Build a simple application to have a conversion using Google Conversation Actions API about the summary you had generated about your video. Refer to Tutorial 5 Conversion Actions API tutorial. MLAB:

INTENT:

Fulfillment:

ENTITY:

OUTPUT: