[GitHub] [hive] difin commented on a diff in pull request #3559: HIVE-26496,Improvement to fetch operator to scan only delete_delta fo…

GitBox Wed, 14 Sep 2022 11:59:05 -0700


difin commented on code in PR #3559:
URL: https://github.com/apache/hive/pull/3559#discussion_r971190432



##########
ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcSplit.java:
##########
@@ -104,28 +104,43 @@ public OrcSplit(Path path, Object fileId, long offset, 
long length, String[] hos
     this.isOriginal = isOriginal;
     this.hasBase = hasBase;
     this.rootDir = rootDir;
-    this.deltas.addAll(filterDeltasByBucketId(deltas, 
AcidUtils.parseBucketId(path)));
+    int bucketId = AcidUtils.parseBucketId(path);
+    AcidUtils.ParsedDeltaLight parentDelta = 
AcidUtils.ParsedDeltaLight.parse(getPath().getParent());

Review Comment:
   Hi @deniskuzZ,
   Many tests failed with the change of using 
AcidUtils.ParsedDeltaLight.parse() instead of 
AcidUtils.parseBaseOrDeltaBucketFilename(). As I understand the split is not 
always a delta folder, it can be some older format not supported by 
ParsedDeltaLight. I saw that ParsedDeltaLight.parse() is used in some cases 
internally in AcidUtils.parseBaseOrDeltaBucketFilename(), but not always. Can 
you please advise if I should revert to using 
AcidUtils.parseBaseOrDeltaBucketFilename() that worked in all cases or there is 
some better way?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [hive] difin commented on a diff in pull request #3559: HIVE-26496,Improvement to fetch operator to scan only delete_delta fo…

Reply via email to