If there is a big burst of data, we may not be able to examine all of it.

Thus, we consider algorithmic solutions to this problem.

Sampling

Hashing

We may not be able to store data in a convenient manner to answer questions quickly. However, hashing can estimate common multiset operations:

To estimate the cardinality of a multiset, we use a HyperLogLog Counter (HLL).