非同步偵測 - jenhaoyang/ml_blog GitHub Wiki
參考:
https://deci.ai/the-correct-way-to-measure-inference-time-of-deep-neural-networks/
https://pytorch.org/docs/stable/notes/cuda.html#asynchronous-execution
https://www.kumarlab.org/2020/03/27/pytorch-keep-the-gpu-busy/
https://medium.com/@ngoodger_7766/fast-gpu-based-pytorch-model-serving-in-100-lines-of-python-9ad3ebd0a1d9