Lecture 9 - AsyDynamics/CS231n GitHub Wiki
Case study
AlexNet
VGG
- smaller filter, deeper networks
- smaller filter has same effective receptive field as deeper layer
GoogLeNet
- Inception module, more efficient
ResNet
- Deeper layers on plain CNN performs worse with larger training error
- solution: use network layers to fit residual mapping, not fit desired underlying mapping directly