tf idf - taoualiw/My-Knowledge-Base GitHub Wiki
Tf-Idf representation : is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus. Tf (term frequency)like bag of words, Idf (inverse document frequency: weighting by how often word occurs in corpus). It rates the rare words higher than the common words