Operator Fusion - AshokBhat/ml GitHub Wiki

About

  • An optimization technique
  • To overlap computation with memory accesses in order to conserve compute cycles

Mechanism

  • Look for operators of different types that follow in a sequence
    • Compute-bound, memory-bound, I/O bound...
  • Fuse them into a single operator.

Illustration

See also