Hadoop Performance - MatteoDJon/CloudProgrammingTonellotto GitHub Wiki

The abscissas of the table correspond to the tuple (number of components of a point, number of clusters) while the ordinates correspond to the number of points in input. The performances have as unit of measure the second.

Mapper, Reducer

3,7 3,13 7,7 7,13
1000
10000
100000

Mapper, Combiner, Reducer

3,7 3,13 7,7 7,13
1000
10000
100000

Mapper, More Reducer

3,7 3,13 7,7 7,13
1000
10000
100000

Mapper, Combiner, More Reducer

3,7 3,13 7,7 7,13
1000 217.9 413.1 204.9 348.3
10000 270.6 363.5 181.2 314.0
100000 327.3 479.1 157.6 391.9