[GitHub] [spark] xccui commented on pull request #28523: [SPARK-31706][SQL] add back the support of streaming update mode

GitBox Thu, 14 May 2020 22:57:27 -0700


xccui commented on pull request #28523:
URL: https://github.com/apache/spark/pull/28523#issuecomment-629043813



   @HeartSaVioR Yes. It seems the output mode option was mainly designed for 
stateful aggregations, which means it actually works in a restricted way.
   
   Ideally, to support complete mode, all the operators must be capable of 
outputting the "complete" result seen so far for each epoch. Personally, I'm in 
favor of removing this mode in a future version. But for now, I propose to add 
more restrictions while doing the plan check (e.g., disallowing the union 
situation you mentioned) and also a note to the documentation.
   
   IMO, the mode of the result table should only be decided by the operators in 
the plan and it could either be "append" or "update" (including the current 
"complete" mode). Basically, the designated sink should match the mode of the 
result table. Usually, supporting "update" needs more effort and that means 
only part of the sinks could be chosen for a plan containing an aggregation or 
some kind of joins.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] xccui commented on pull request #28523: [SPARK-31706][SQL] add back the support of streaming update mode

Reply via email to