papers - animeshtrivedi/notes GitHub Wiki
Locking, scaling and synchronization
- Rachid Guerraoui, Hugo Guiroux, Renaud Lachaize, Vivien Quéma, and Vasileios Trigonakis. 2019. Lock–Unlock: Is That All? A Pragmatic Analysis of Locking in Software Systems. ACM Trans. Comput. Syst. 36, 1, Article 1 (February 2018), 149 pages. https://doi.org/10.1145/3301501
QoS/scheduling/Resource management
- FIRM: An Intelligent Fine-grained Resource Management Framework for SLO-Oriented Microservices https://www.usenix.org/conference/osdi20/presentation/qiu
FPGA
ML
- Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight, https://arxiv.org/pdf/2407.08694
- Putting Machine Learning into Production Systems Data validation and software engineering for machine learning, https://queue-acm-org.vu-nl.idm.oclc.org/detail.cfm?id=3365847
- https://github.com/mcanini/SysML-reading-list/blob/master/README.md
- Challenges in Deploying Machine Learning: a Survey of Case Studies, https://arxiv.org/abs/2011.09926v1