alexeykudinkin commented on code in PR #5241:
URL: https://github.com/apache/hudi/pull/5241#discussion_r844303357


##########
hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java:
##########
@@ -64,7 +64,7 @@ private HoodieMetadataMergedLogRecordReader(FileSystem fs, 
String basePath, Stri
                                               ExternalSpillableMap.DiskMapType 
diskMapType,
                                               boolean 
isBitCaskDiskMapCompressionEnabled,
                                               Option<InstantRange> 
instantRange, boolean enableFullScan) {
-    super(fs, basePath, logFilePaths, readerSchema, latestInstantTime, 
maxMemorySizeInBytes, false, false, bufferSize,
+    super(fs, basePath, logFilePaths, readerSchema, latestInstantTime, 
maxMemorySizeInBytes, true, false, bufferSize,

Review Comment:
   We should not couple those -- these configs control different aspects:
   
   1. `forceFullScan` (renamed it to make its semantic crystal clear) -- forces 
to read all records from the block
   2. `readBlocksLazily` -- controls whether we merge blocks eagerly as we read 
them or backward-pass when all of them are read.
   
   We should remove `readBlocksLazily` altogether actually and leave only 
backward-pass behavior, since the other alternatives will be yielding incorrect 
merge results (one example is HUDI-3342, another example is incorrect handling 
of deletes)
   
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to