[GitHub] [hive] pvary commented on a change in pull request #3131: HIVE-26102: Implement DELETE statements for Iceberg tables

GitBox Fri, 01 Apr 2022 05:31:57 -0700


pvary commented on a change in pull request #3131:
URL: https://github.com/apache/hive/pull/3131#discussion_r840539431




##########
File path: 
iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/mapreduce/IcebergInputFormat.java
##########
@@ -261,6 +268,13 @@ public boolean nextKeyValue() throws IOException {
       while (true) {
         if (currentIterator.hasNext()) {
           current = currentIterator.next();
+          Configuration conf = context.getConfiguration();
+          if (HiveIcebergStorageHandler.isDelete(conf, 
conf.get(Catalogs.NAME))) {
+            if (current instanceof GenericRecord) {
+              PositionDeleteInfo pdi = 
IcebergAcidUtil.parsePositionDeleteInfoFromRecord((GenericRecord) current);
+              PositionDeleteInfo.serializeIntoConf(conf, pdi);

Review comment:
       My issue is that the PDI contains the position info too. That should not 
change anything.
   My main issue here is that we do the parsing / serialization for every 
record which will be resource intensive




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [hive] pvary commented on a change in pull request #3131: HIVE-26102: Implement DELETE statements for Iceberg tables

Reply via email to