Technologies - gt-big-data/wiki GitHub Wiki

The following are some different tools and technologies for working with big data

  • Cassandra: A distributed database for large data
  • D3.js: A JavaScript framework for data visualization
  • Elastic Search: Full text search
  • Hadoop: Hadoop distributed filesystem and MapReduce framework
  • Mahout: Suite of machine learning tools for Hadoop
  • NLTK: Natural language toolkit, a Python package for NLP
  • Pig: An Extract-Transform-Load tool
  • Tableau: Data visualization software