Operator Fusion - AshokBhat/ml GitHub Wiki
About
- An optimization technique
- To overlap computation with memory accesses in order to conserve compute cycles
Mechanism
- Look for operators of different types that follow in a sequence
- Compute-bound, memory-bound, I/O bound...
- Fuse them into a single operator.