distributed data systems - taoualiw/My-Knowledge-Base GitHub Wiki

Distributed data systems

  • parallel data base and processing
  • MapReduce concept
  • Hadoop ecosystem (Sqoop, MapRed)
  • Apache Flume, Scribe for large log data
  • Apache Pig, Apache Spark
  • Apache Hive
  • MongoDB
  • cloud architecture like AWS
⚠️ **GitHub.com Fallback** ⚠️