TVM - AshokBhat/ml GitHub Wiki

About

Component	Description
Relay	High-level functional IR used to represent full models
TOPI	Tensor operator library
Runtime	Executes compiled artifacts
Frontend	ingest models from different frameworks into the TVM stack

Optimizing CNN Model Inference on CPUs - https://arxiv.org/pdf/1809.02697.pdf
Efficient Execution of Quantized Deep Learning Models: A Compiler Approach - https://arxiv.org/pdf/2006.10226.pdf