Yingjie Cao created FLINK-17568:
-----------------------------------

             Summary: Task may consume data after checkpoint barrier before 
performing checkpoint for unaligned checkpoint
                 Key: FLINK-17568
                 URL: https://issues.apache.org/jira/browse/FLINK-17568
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
    Affects Versions: 1.11.0
            Reporter: Yingjie Cao
             Fix For: 1.11.0


For unaligned checkpoint, task may consume data after the checkpoint barrier 
before performing checkpoint which lead to consumption of duplicated data and 
corruption of data stream.

More specifically, when the Netty thread notifies the checkpoint barrier for 
the first time and enqueue a checkpointing task in the mailbox, the task thread 
may still in data consumption loop and if it reads a new checkpoint barrier 
from another channel it will not return to the mailbox and instead it will 
continue to read data until a all data consumed or we have a full record, 
meanwhile, the data after checkpoint barrier may be read and consumed which 
lead to inconsistency.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to