ICE 7 - slcc2c/CS5590_Python GitHub Wiki
In class exercise: In class, we further work on the tokenization, pos-tagging, entity extraction, biagram and trigram. For all the exercises import the right module from NLTK. You need to go through the slides to find them.
- Define a paragraph of text includes three sentences
- Use WordNet module to find the meaning of some words in your text
- Apply Tokenization on the text
- Apply Stemming on the text
- Apply POS on the text
- Apply Lemmatization on the text
- Apply Trigram on the text
- Apply Named Entity Recognition on the text
Code is here I could not get the NLTK downloader working on my Mac