Lecture 9 - AsyDynamics/CS231n GitHub Wiki

Case study

AlexNet

VGG

  • smaller filter, deeper networks
  • smaller filter has same effective receptive field as deeper layer

GoogLeNet

  • Inception module, more efficient

ResNet

  • Deeper layers on plain CNN performs worse with larger training error
  • solution: use network layers to fit residual mapping, not fit desired underlying mapping directly