1032851561 commented on issue #8087:
URL: https://github.com/apache/hudi/issues/8087#issuecomment-1484344281

   > > The first checkpoint barrier is behind 1200 splits.
   > 
   > So you are talking about that the barrier is queued up behind of these 
input splits, in `#processElement`, we just put the input split into a local 
queue, and then call up an async task through `#enqueueProcessSplits` to ask 
for the checkpoint lock for split processing, so the split processing and split 
cache/barrier are handle in different thread, why like you said, the barrier 
can be blocked?
   > 
   > When a split is in processing, the flag `currentSplitState` should be 
`SplitState.RUNNING` then the `#enqueueProcessSplits` should return as fast, 
the checkoint barrier could find a chance to jump in the mail box queue.
   
   `processElement`,`snapshotState`,`processSplits` are handle in same thread, 
each split cache trigger at least a mini batch consume.
   
   
![image](https://user-images.githubusercontent.com/15909510/227820890-387dfaa1-25c8-48ab-8999-44f68700131a.png)
   
![image](https://user-images.githubusercontent.com/15909510/227820908-0cd1470e-d06f-4838-aa65-8b77c5b42c56.png)
   
![image](https://user-images.githubusercontent.com/15909510/227820913-b382c914-a96f-4d0e-a9d8-d3e319b66e15.png)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to