Bibliography - okalldal/gf-exjobb GitHub Wiki

Tree probabilities

  • Tutorial about context free grammar PDF

Expectation Maximization

  • Short EM tutorial with a good visualization of the algorithm for a simple coin toss toy experiment PDF
  • Slides explaining Expectation maximization with multinomials as an example PDF
  • Paper about how the EM-algorithm connects to the variational inference PDF
  • Explanation of using Naive Bayes model together with ML and EM PDF

N-grams

  • Kneser-Ney: State of the art smoothing wikipedia
  • Stupid Backoff: Much easier to compute than Kneser-Ney: pdf
  • Jurafsky and Martin: about N-grams, has a good overview and a section about Kneser-Ney PDF
  • Methods for smoothing Website
  • Good-Turing Estimate PFD
  • OpenFST a framework for working with N-gram models Website

Variational Inference

  • Slides gining an introduction to variational inference with multinomial mixed model as example PDF
  • Original PhD-thesis presenting the variational inference method PDF

Wordnet - similarity and information content

  • Resnik similarity, a measure to determine semantic similarity between synsets by information content of least common subsumer. PDF

Misc

  • Embeddings on UD parsed data PDF
  • Trainomatic
    Uses BableNet to generate a training/evaluation set for word sense disambiguation
    Website PDF
  • Context2vec
    A project which uses a LSTM network to learn a vector-context for words in a sentence. Probably can be modified to train on UD-tagged data? PDF Github Modification of LSTM to work on tree structures PDF

Unix tips and tricks

  • sort and uniq to generate ngrams link