Word Concurrence - wywfalcon/twitter-healthcare-analysis GitHub Wiki

Word Concurrence

The goal of this project is to find statistics about word pairs in tweets. It also uses the AFINN file, which specifies the sentiment of common words. We can then find out what people are talking about, how often they do and whether they are positive things.

Usage

Get the scores of each word and find pairs

$ get_concurrence [afinnFile] [jsonTweetsFile] [destinationDirectory]

Explanation

The program uses the AFINN file to sum up the score of a word each time it is found in a tweet
The score is recorded into score.csv
Then for each pair of the words found together in the same tweet, the concurrence count and the word-to-word-pair ratio is recorded
The results are exported to the file concurrence