Tutorial 4 - Nagumkc/CS5560_KDM_Lab-assignments GitHub Wiki
NAME: Nageswara rao Nandigam
ClassID: 18
StudentID: 16244177
Lab Assignment #4
1.In Class Question
Write a simple spark program to read a dataset and do the following tasks
a.Extract Triplets using OpenIE
code:
Output:
b.Extract Semantic Meaning using ConceptNet
code:
Output:
c.Extract Synonyms using WordNet
Code:
Output:
d.Group the Data into LDA in below given pipeline and compare results
i.Data=>LDA
Code:
Output:
ii.Data=> NLP =>LDA
Code:
Output:
iii.Data=>NLP=>StopWord=>LDA
iv.Data=>NLP=>StopWord=>TFIDF=>LDA
Code:
Output:
2.Take home Question
Create a simple question answering system as an extension of the dataset and tasks done in (1).Continuation from Tutorial 3.
Designed question and answer system using triplet extraction and storing it in array list and matching with question and extracting predicate from triplets.
Code:
Output:
Question 1:
Answer:
Question 2:
Answer:
Question 3:
Answer: