There is an extremely high volume that is produced daily + needs to be stored.

The price of storage has been decreasing over time:

image.png

image.png

Idea of data scale:

Main sources of data are from:

We need to be able to process this data.

Rather than thinking about how to design a distributed network, we just use one.