ICE 7 - slcc2c/CS5590_Python GitHub Wiki

In class exercise: In class, we further work on the tokenization, pos-tagging, entity extraction, biagram and trigram. For all the exercises import the right module from NLTK. You need to go through the slides to find them.

Define a paragraph of text includes three sentences
Use WordNet module to find the meaning of some words in your text
Apply Tokenization on the text
Apply Stemming on the text
Apply POS on the text
Apply Lemmatization on the text
Apply Trigram on the text
Apply Named Entity Recognition on the text

Code is here I could not get the NLTK downloader working on my Mac