wikipedia data with apache spark - vaquarkhan/Apache-Kafka-poc-and-notes GitHub Wiki
https://www.infoq.com/presentations/wikipedia-apache-spark
http://www.slideshare.net/SandyRyza/lsa-47411625
http://mindfulmachines.io/blog/2015/12/20/wikipedia-data-in-spark
https://www.percona.com/blog/2015/10/07/using-apache-spark-mysql-data-analysis/
https://dumps.wikimedia.org/other/pagecounts-raw/
https://github.com/datawrangling/trendingtopics/blob/master/README.textile