deep learning - Serbipunk/notes GitHub Wiki

kaiming initialization

a zero-centered Gaussian with standard deviation of sqrt{ 2 / n_l} (variance shown in equation above)