ICP7 - SaranAkkiraju/Python_and_Deep_Learning_Programming_ICP GitHub Wiki
Objectives
1. Extract the following web URL text using BeautifulSouphttps://en.wikipedia.org/wiki/Google
2. Save it in input.txt
3. Apply the following on the text and show output:
a. Tokenization
b. POS
c. Stemming
e. Trigram
f. Named Entity Recognition
4. Change the classifier in the given code to
a. KNeighborsClassifierand see how accuracy changes
b. change the tfidf vectorizer to use bigram and see how the accuracy changes TfidfVectorizer(ngram_range=(1,2))
c. Put argument stop_words='english' and see how accuracy changes