[GitHub] [hudi] nsivabalan commented on a change in pull request #3762: [HUDI-1294] Adding inline read and seek based read(batch get) for hfile log blocks in metadata table

GitBox Wed, 13 Oct 2021 16:30:52 -0700


nsivabalan commented on a change in pull request #3762:
URL: https://github.com/apache/hudi/pull/3762#discussion_r728518751




##########
File path: 
hudi-common/src/main/java/org/apache/hudi/metadata/BaseTableMetadata.java
##########
@@ -126,23 +130,21 @@ protected BaseTableMetadata(HoodieEngineContext 
engineContext, HoodieMetadataCon
   }
 
   @Override
-  public Map<String, FileStatus[]> getAllFilesInPartitions(List<String> 
partitionPaths)
+  public Map<String, FileStatus[]> getAllFilesInPartitions(List<String> 
partitions)
       throws IOException {
     if (enabled) {
-      Map<String, FileStatus[]> partitionsFilesMap = new HashMap<>();
-
       try {
-        for (String partitionPath : partitionPaths) {
-          partitionsFilesMap.put(partitionPath, fetchAllFilesInPartition(new 
Path(partitionPath)));
-        }
+        // need to understand why we did not make bulk get before

Review comment:
       from what I infer, with HoodieMergedLogRecordScanner, we first read all 
records from all log blocks and prepare a hash map of records(record key to 
HoodieRecord). And we don't do seek based read prior to this patch and so we do 
read all log records from all log blocks. so was bit curious. 
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] nsivabalan commented on a change in pull request #3762: [HUDI-1294] Adding inline read and seek based read(batch get) for hfile log blocks in metadata table

Reply via email to