Re: [PR] [HUDI-7102] Fix a bug for time travel queries on MOR tables [hudi]

via GitHub Tue, 28 Nov 2023 16:56:17 -0800


yihua commented on code in PR #10102:
URL: https://github.com/apache/hudi/pull/10102#discussion_r1408590210



##########
hudi-common/src/main/java/org/apache/hudi/common/table/log/BaseHoodieLogRecordReader.java:
##########
@@ -260,7 +260,7 @@ private void scanInternalV1(Option<KeySpec> keySpecOpt) {
             && 
!HoodieTimeline.compareTimestamps(logBlock.getLogBlockHeader().get(INSTANT_TIME),
 HoodieTimeline.LESSER_THAN_OR_EQUALS, this.latestInstantTime
         )) {
           // hit a block with instant time greater than should be processed, 
stop processing further
-          break;
+          continue;
         }

Review Comment:
   The `BaseHoodieLogRecordReader` is used by the new file group reader only 
and this one uses `HoodieLogFormatReverseReader` which reads the log files in 
reverse order based on the input log file list.  Right now for time travel 
queries, the file system view still returns more logs than required (e.g., bf, 
log_t1, log_t2, log_t3, log_t4, for time travel query with `as of t2`), so this 
change makes sure the processing does not stop unnecessarily before the file 
system view returns the exact files to process.
   
   @linliu-code could you add this detail to the other ticket fixing the file 
system view APIs based on completion time, so we should revisit this part 
alongside the FSV fix?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] [HUDI-7102] Fix a bug for time travel queries on MOR tables [hudi]

Reply via email to