Xinyu Liu created SAMZA-2801:
--------------------------------

             Summary: Support excluding tasks from watermark computation when 
exceeding idle time
                 Key: SAMZA-2801
                 URL: https://issues.apache.org/jira/browse/SAMZA-2801
             Project: Samza
          Issue Type: Improvement
            Reporter: Xinyu Liu
            Assignee: Xinyu Liu


Currently in the Samza event-time watermark aggregation logic, it will compute 
the watermark as the min of watermarks from all upstream tasks. However, in the 
lagging cases, the upstream task might not generate watermark for a long 
period. In this case, the new watermarks will not be generated and the 
downstream aggregation will be stuck.

To address this issue, we will implement an mechanism to exclude the tasks that 
have been "idle" in generating watermark for a configured time, so that the 
aggregated watermarks will still be generated. Note this mechanism will unblock 
downstream, but also at the risk of moving eventtime clock faster and the 
events from lagging tasks will become late arrivals.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to