[GitHub] spark issue #21220: [SPARK-24157][SS] Enabled no-data batches in MicroBatchE...

tdas Fri, 04 May 2018 01:31:36 -0700

Github user tdas commented on the issue:

    https://github.com/apache/spark/pull/21220
  
    @brkyvz Answers to your questions.
    
    1. We have already fixed those emptyDF optimizations. The optimization only 
quick in `df.isStreaming = false`, and emptyDFs generated by sources should be 
have isStreaming = true. That's for v1 sources. For v2 sources, the engine 
already takes care of that by making sure that 
`StreamingDataSourceV2Relation.isStreaming` is true where 
`StreamingDataSourceV2Relation` is the logical plan leaf inserted into the 
micro-batch logical plan irrespective of empty or not.
    
    2. I will do both type of timeouts in flatMapGroupsWithState in later PR.




---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #21220: [SPARK-24157][SS] Enabled no-data batches in MicroBatchE...

Reply via email to