Apache Spark Machine Learning (MLlib) From where to start? - vaquarkhan/Apache-Kafka-poc-and-notes GitHub Wiki
-
Step 1: Get the mathematical knowledge on machine learning this page should give you what you need to know. http://fastml.com/math-for-machine-learning/
-
Step 2: Buy some machines on AWS. Don't spend time on building your own cluster. https://aws.amazon.com/emr/details/spark/
-
Step 3: Complete the book and all exercises in it. http://tinyurl.com/z4sznu2
-
Step 4: Challenge yourself with real-world machine learning problems on Kaggle. https://www.kaggle.com/competitions
- From www.packtpub.com :
- Ebook "Spark for Data Science" (September 2016)
- Ebook "Large Scale Machine Learning with Spark" (October 2016)
- Ebook "Machine Learning with Spark" (February 2015)
- Alpha Program Video "Data Science with Spark" (October 31, 2016)