pnowojski commented on PR #20485:
URL: https://github.com/apache/flink/pull/20485#issuecomment-1231242878

   > I wonder about the way the SourceOperator calls through the stack via 
pauseOrResumeSplits while also being called back with watermarks. My intuition 
would have been to do the pausing directly at the splits based on the current 
max watermark lag, e.g. by putting splits into a suspended mode where no more 
output is accepted from a split. I guess this could be tricky because the 
implementation of the source doesn't necessarily know it's paused and could 
timeout or otherwise misbehave. So it looks like the current implementation is 
simpler.
   
   Currently Splits are just pojos, data carriers, without much logic. If I 
understand you correctly, what you are suggesting would require us to push some 
availability logic down to the splits and it would still require handling of 
this availability in the `SourceReader` - basically the same logic that we have 
in the current proposal. The difference would be only that `SourceReader` would 
be informed about (un)availability change of a split, from the split, instead 
of from the `pauseOrResume` call.
   
   Given that most of the users are expected to just use `SourceReaderBase`, 
the current proposal gives them watermark alignment for free. While adding 
availability logic to the splits would increase exposure of this feature in the 
public API, without much benefit?
   
   > Any thoughts on how and when to proceed with this PR? We'd like to help to 
bring this over the finish line.
   
   Indeed there is an issue now, that I think @smattheis doesn't work on Flink 
anymore, so he can not finish this PR. I will go through the PR myself and do a 
quick pass/respond to the inlined questions from you @mxm  and @mas-chen , but 
it would be great if someone could take over applying the actual changes after 
the review. Would you be up to do that @mxm ?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to