Tutorial 4 - Nagumkc/CS5560_KDM_Lab-assignments GitHub Wiki

NAME: Nageswara rao Nandigam


ClassID: 18


StudentID: 16244177


Lab Assignment #4


1.In Class Question

Write a simple spark program to read a dataset and do the following tasks


a.Extract Triplets using OpenIE

code:

Output:


b.Extract Semantic Meaning using ConceptNet

code:

Output:


c.Extract Synonyms using WordNet

Code:

Output:


d.Group the Data into LDA in below given pipeline and compare results


i.Data=>LDA

Code:

Output:


ii.Data=> NLP =>LDA

Code:

Output:


iii.Data=>NLP=>StopWord=>LDA


iv.Data=>NLP=>StopWord=>TFIDF=>LDA

Code:

Output:


2.Take home Question

Create a simple question answering system as an extension of the dataset and tasks done in (1).Continuation from Tutorial 3.


Designed question and answer system using triplet extraction and storing it in array list and matching with question and extracting predicate from triplets.

Code:

Output:

Question 1:

Answer:

Question 2:

Answer:

Question 3:

Answer: