Home - vidyasekaran/bigdata_frameworks_components GitHub Wiki
Welcome to the bigdata_frameworks_components wiki!
Good site to setup hadoop in ubuntu http://codesfusion.blogspot.in/2013/10/setup-hadoop-2x-220-on-ubuntu.html
Course details https://www.greatlearning.in/pdf/cloud-computing-program-brochure.pdf
bits pilani big data engineering http://www.bits-pilani.ac.in/university/wilp/BigDataEngineering
sentiment analysis http://blog.cloudera.com/blog/2012/11/analyzing-twitter-data-with-hadoop-part-3-querying-semi-structured-data-with-hive/
http://blog.cloudera.com/blog/2012/10/analyzing-twitter-data-with-hadoop-part-2-gathering-data-with-flume/ Twitter Avro to JSON https://stackoverflow.com/questions/37324561/retrieving-data-from-twitter-using-flume-and-storing-to-hdfs-in-json-format
MapReduce on Avro Data Files https://dzone.com/articles/mapreduce-avro-data-files
miguno/avro-hadoop-starter https://github.com/miguno/avro-hadoop-starter
Big data serialization using Apache Avro with Hadoop https://www.ibm.com/developerworks/library/bd-avrohadoop/index.html
Spark Kerberos authentication Refer chap 13 - Mastering Spark for Data Science by Matthew Hallett; Antoine Amend; Andrew Morgan; David George