Notes of Spark Structured Streaming

Some keynotes of structured streaming

Posted by Hanke on December 16, 2020

Deduplicate

Removing duplicates bounded by a watermark

Custom Stateful Processing

Use mapGroupsWithState and flatMapGroupsWithState to customized your stateful processing

Reference