Overview of Hadoop Ecosystem - kaushikdas/TechnicalWritings GitHub Wiki

HDFS (part of Hadoop): Hadoop equivalent of GFS version distributes the storage of big data across cluster of computers virtually creating a giant file system of all the hard drives on the cluster maintains redundant copies of that data for failure recovery