ICP 7 - ntihindukkipati/CS5590_Python_DL GitHub Wiki
-
Extract the following web URL text using BeautifulSoup
https://en.wikipedia.org/wiki/Google -
Save it in input.txt3. Apply the following on the text and show output:
a. Tokenization
b. POS
c. Stemming
d. Lemmatization
e. Trigram
f. Named Entity Recognition
-
Change the classifier in the given code to:
a. KNeighborsClassifier and see how accuracy changes
b. change the TF-IDF vectorizer to use bigram and see how the accuracy changes TfidfVectorizer(ngram_range=(1,2))
c. Put argument stop_words='english'and see how accuracy changes
BY
DUKKIPATI SRI SAI NITHIN CHOWDARY
CLASS ID: 4