Exercise 1 - akanshagoel07/NLP_Coursework GitHub Wiki

Welcome to the NLP_Coursework wiki!

Questions:

  1.  Why we use stopwords? Why stopwords are not necessary for NLP frequency distribution.
    

Stopwords are used for grammatically correct sentence formation. Stopwords does not provide any useful information while understanding the context of the text data, that is why we remove stopwords as they occur frequently and confuses model.

  1.  Based on high frequency words what information you can extract from the graph?
    

Based on high frequency words, It seems that the SpaceX company has designed reused flight which can carry crew to mars for their next mission.

  1.  Can you provide different visualization for frequency distribution? If yes, please perform. If no, why?
    

Yes, we can use another visualization method. I have plotted a bar chart for the same data by converting a dictionary to a dataframe.