HADOOP ICP 6 - Apoorvag2597/BDP_Revised GitHub Wiki

Name - Apoorva Geetanjali

Class ID - 34

First, we need to check for solr.

Part 1 - Films Dataset 1.First, create the dataset and schema should be edited. 2.Open solr from the web-browser. 3.In the download section add the data set from the link given in the question. Copy the raw file and paste it here. After clicking the submit we will be able to see the success page. 4.From the Queries option open the page and write the queries accordingly

Dataset -

Query 1 - Listing all the action films

Query 2 - Listing Name, Director and genre

Query 3 - David and Animation

Query 4 - Harry and Initial release date

Query 5 - Name as Harry

Part 2 - Book Dataset 1.Create Instance directory and creating collection for books 2.Edit the documents tab with the raw file for Books csv as follows 3.Check if the values are correctly reflected

Dataset-

Query 1 - Genre as Scifi

Query2 - Listing all in stock

Query 3- By George R. Martin

Query 4 - Price

Query 5 - Series