Network/disk latencies are expensive compared to other operations ⇒ reducing transferring intermediate data increases algorithmic efficiency.

Combiners are one mechanism for local aggregation.

image.png

Advantages of In-Mapper Combining

Drawbacks of In-Mapper Combining

Extent to which efficiency can be increased through local aggregation depends on: