Imp links - veeraravi/Spark-notes GitHub Wiki
https://docs.databricks.com/spark/latest/data-sources/read-parquet.html https://docs.databricks.com/spark/latest/rdd-streaming/tips-for-running-streaming-apps-in-databricks.html https://jaceklaskowski.gitbooks.io/mastering-apache-spark/spark-sql-Dataset.html https://github.com/hyzhangsf/stat133-1/tree/master/datasets http://spark.apache.org/docs/latest/sql-programming-guide.html https://www.balabit.com/blog/spark-scala-dataset-tutorial/ https://databricks.com/blog/2016/01/04/introducing-apache-spark-datasets.html https://docs.cloud.databricks.com/docs/spark/1.6/index.html#examples/Star%20Expansion%20in%20SQL%20and%20DataFrames.html https://indatalabs.com/blog/data-engineering/convert-spark-rdd-to-dataframe-dataset https://blog.codecentric.de/en/2016/07/spark-2-0-datasets-case-classes/ https://github.com/udavPit/spark-user-feedback/blob/master/src/main/scala/feedback/Feedback.scala http://blog.madhukaraphatak.com/categories/spark-two/ https://github.com/phatak-dev/spark2.0-examples https://databricks.com/blog/2015/01/09/spark-sql-data-sources-api-unified-data-access-for-the-spark-platform.html https://hortonworks.com/tutorial/getting-started-with-apache-zeppelin/ http://www.sparkexpert.com/tag/data-sources-api/ --- spark to rdbms https://databricks.com/blog/2017/08/31/cost-based-optimizer-in-apache-spark-2-2.html https://databricks.com/blog/2016/01/04/introducing-apache-spark-datasets.html http://www.agildata.com/apache-spark-rdd-vs-dataframe-vs-dataset/