Scalable Advanced Massive Online Analysis - arinto/samoa GitHub Wiki

Scalable Advanced Massive Online Analysis (SAMOA) is a platform for mining big data streams.

It contains various algorithms for machine learning and data mining on data streams, and allows to run them on different distributed stream processing engines (SPEs) such as Storm and S4.

Currently we support classification via Vertical Hoeffding Trees and clustering via CluStream.