Good reference on Attention and Transformer - SoojungHong/MachineLearning GitHub Wiki http://www.peterbloem.nl/blog/transformers