Tathagata Das created SPARK-11814:
-------------------------------------
Summary: Set better default DStream checkpoint interval
Key: SPARK-11814
URL: https://issues.apache.org/jira/browse/SPARK-11814
Project: Spark
Issue Type: Improvement
Reporter: Tathagata Das
Assignee: Tathagata Das
DStream checkpoint interval is by default set at max(10 second, batch
interval). That's bad for large batch intervals where the checkpoint interval =
batch interval, and RDDs get checkpointed every batch.
This JIRA is to set the checkpoint interval of trackStateByKey to 10 * batch
duration.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]