Lecture 4 - AsyDynamics/CS231n GitHub Wiki

Back propagation

  • Local gradient $/times$ upstream gradient
  • Combine some of the operation as pattern, e.g. the sigmoid function
  • add gate - gradient distributor
  • max gate - gradient router
  • mul gate - gradient switcher
  • Jacobian matrix
  • API forward backward

Neural netword

  • before: linear score function
  • now, with multiple layer
  • activation function, many types