Reading List - randalburns/atidic-fall18 GitHub Wiki
Definitely Read
NN performance and GPUs
Stochastic Gradient Descent
Cloud Frameworks
Architecture
Performance
-
Sparse-matrix representations (http://journals.sagepub.com/doi/pdf/10.1177/1094342011403516)
Algorithms/Methods
Gradient Boosted Trees
Approximate Nearest Neighbors
SVM
Spectral Clustering
- W. Y. Chen et al. Parallel Spectral Clustering in Distributed Systems. IEEE Trans PAMI, 33(3), 2011.
- L. Zelnik-Manor and P. Perona. Self-Tuning Spectral Clustering. NIPS 2005.
ISOMAP/LLE
Papers we might read
MLLib
- Meng et al. MLlib: Machine Learning in Apache Spark. JMLR 17, 2016.
- Zaharia et al. Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. NSDI 2012.