Spark stream processing frameworks include Apache Spark Streaming, Apache Storm, and Apache Flink.

With Spark Streaming, we often deal with discrete streams (DStream ).

image.png

Key concepts with Spark Streaming:

In general, incremental counting generalizes to many reduce operations. We need a function to “inverse reduce” (e.g. “subtract” for counting).

image.png

Spark streaming is fast:

image.png