Lesson 20 Distributed Processing with Spark - adparker/GADSLA_1403 GitHub Wiki
####Slides
Handouts
Links
- Spark Examples
- Ampcamp Slides and Exercises
- Spark is a Crossover Hit For Data Scientists
- Sketching Data Structures
- Probabilistic Data Structures for Web Analytics and Data Mining
- Streaming Algorithms and Sketches
- Fast, Cheap, and 98% Right: Cardinality Estimation for Big Data
- Algebra for Analytics