ann created HUDI-7030:
-------------------------

             Summary: Log reader data lost as that not consistent behavior in 
timeline's containsInstant 
                 Key: HUDI-7030
                 URL: https://issues.apache.org/jira/browse/HUDI-7030
             Project: Apache Hudi
          Issue Type: Bug
    Affects Versions: 0.14.0
            Reporter: ann
         Attachments: image-2023-11-03-19-48-29-441.png, 
image-2023-11-03-19-49-22-894.png, image-2023-11-03-19-50-11-849.png, 
image-2023-11-03-19-58-39-495.png, image-2023-11-03-20-06-00-579.png, 
image-2023-11-03-20-06-13-905.png, image-2023-11-03-20-07-30-201.png

Log reader is filtered all log data blocks which come from inflight instant. 
!image-2023-11-03-19-49-22-894.png!

*containsInstant* return false when input instant's timestamp is not equal as 
anyone instant timestamp in inflight timeline. 
!image-2023-11-03-20-07-30-201.png!

But now, in timeline's *containsInstant* that input is instant's timestamp, it 
would return true.

!https://issues.apache.org/jira/secure/temporaryattachment/e53a9a4d763501eff6baeff2832c2a2c44a30713/temp2778457713180842709_image-2023-11-03-20-06-13-905.png!

When input is the instant with default_millis_ext, instant's timestamp is less 
than someone instant timestamp in timeline. 
!image-2023-11-03-19-50-11-849.png!

In finally, log reader skipped the completed delta commit instant and caused 
data lost.
!image-2023-11-03-19-58-39-495.png!

I think timeline's containsInstant should have consistent behavior and update 
containsOrBeforeTimelineStarts to containsInstant
!image-2023-11-03-19-48-29-441.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to