Good reference on Attention and Transformer - SoojungHong/MachineLearning GitHub Wiki