Streams and Pocessing - GregLinthicum/From-Logistic-Regression-to-Long-short-term-memory-RNN GitHub Wiki
Spark
AWS EMR - Submitting Spark Jobs notifyme text
ETL Pipeline in Apache Spark and Python
ITVersity Big Data Labs — Hive Starter Kits
Building Spark JAR Files with SBT
AWS
Crawler clickology starts at 11:20
Amazon EMR Deep Dive 2020 Scala Script Example - Streaming ETL
AWS DynamoDB Streams to Lambda Tutorial in Python
Scala
typesafe-config-2.10.1.jar, Configuration library for JVM languages
Scala Testing
Scala (v3) Testing With ScalaTest styles
Let’s write some tests for Spark Scala DataFrame transformations using Mockito and scalatest
Automated Data Quality Testing at Scale using Apache Spark (+Deequ)
Testing data quality at scale with PyDeequ