Scalable Data Science book data set for analysis - vaquarkhan/Apache-Kafka-poc-and-notes GitHub Wiki