1032851561 commented on issue #8087: URL: https://github.com/apache/hudi/issues/8087#issuecomment-1484344281
> > The first checkpoint barrier is behind 1200 splits. > > So you are talking about that the barrier is queued up behind of these input splits, in `#processElement`, we just put the input split into a local queue, and then call up an async task through `#enqueueProcessSplits` to ask for the checkpoint lock for split processing, so the split processing and split cache/barrier are handle in different thread, why like you said, the barrier can be blocked? > > When a split is in processing, the flag `currentSplitState` should be `SplitState.RUNNING` then the `#enqueueProcessSplits` should return as fast, the checkoint barrier could find a chance to jump in the mail box queue. `processElement`,`snapshotState`,`processSplits` are handle in same thread, each split cache trigger at least a mini batch consume.    -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
