All images from Andrew Ng's Deeplearning.ai course 5: sequence models
Existing language models (RNN + encoder-decoder)
- Problems:
  - Single vector representation by encoder
  - Network forgets for too long sequences
Intuition of attention
- Self attention
- Look at smaller window of sequence and previous output to generate next output
- w<t,t'> => weights of input sequence t' to look at for generating output t
- weights generated by nerual network with input of previous output + activations
- other types: for non NLP applications
Transformer model
- From 'Attention is all you need' https://arxiv.org/abs/1706.03762
- Google brain
- No encoder RNN, only attention
- Feed forward arch,
  - input => activation (hiddent state)
  - activation + prev output => output
- Multi head attention
  - For each input word, get h different attentions (h = 8 for transformer)
  - Allows focussing on different part of sentence
- Position embedding in output
  - Encode position explicity (no recurrence)
- Masked attention in decoder
  - Prevent looking at future output at training
- Attention in encoder as well as decoder

lime

This project is about explaining what machine learning classifiers (or models) are doing. At the moment, we support explaining individual predictions for text classifiers or classifiers that act on tables (numpy arrays of numerical or categorical data) or images, with a package called lime (short for local interpretable model-agnostic explanations). Lime is based on the work presented in this paper (bibtex here for citation).

Screenshots

Below are some screenshots of lime explanations. These are generated in html, and can be easily produced and embedded in ipython notebooks. We also support visualizations using matplotlib, although they don't look as nice as these ones.

Two class case, text

Negative (blue) words indicate atheism, while positive (orange) words indicate christian. The way to interpret the weights by applying them to the prediction probabilities. For example, if we remove the words Host and NNTP from the document, we expect the classifier to predict atheism with probability 0.58 - 0.14 - 0.11 = 0.31.

twoclass

Multiclass case

multiclass

Tabular data

tabular

Images (explaining prediction of 'Cat' in pros and cons)

What are explanations?

Intuitively, an explanation is a local linear approximation of the model's behaviour. While the model may be very complex globally, it is easier to approximate it around the vicinity of a particular instance. While treating the model as a black box, we perturb the instance we want to explain and learn a sparse linear model around it, as an explanation. The figure below illustrates the intuition for this procedure. The model's decision function is represented by the blue/pink background, and is clearly nonlinear. The bright red cross is the instance being explained (let's call it X). We sample instances around X, and weight them according to their proximity to X (weight here is indicated by size). We then learn a linear model (dashed line) that approximates the model well in the vicinity of X, but not necessarily globally. For more information, read our paper, or take a look at this blog post.

Attention is all you need - notitiam/ML-paper-notes GitHub Wiki

lime

Screenshots

Two class case, text

Multiclass case

Tabular data

Images (explaining prediction of 'Cat' in pros and cons)

What are explanations?

⚠️ GitHub.com Fallback ⚠️

Attention is all you need - notitiam/ML-paper-notes GitHub Wiki

lime

Screenshots

Two class case, text

Multiclass case

Tabular data

Images (explaining prediction of 'Cat' in pros and cons)

What are explanations?

⚠️ **GitHub.com Fallback** ⚠️

⚠️ GitHub.com Fallback ⚠️