[
https://issues.apache.org/jira/browse/HIVE-26319?focusedWorklogId=780788&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-780788
]
ASF GitHub Bot logged work on HIVE-26319:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 13/Jun/22 11:46
Start Date: 13/Jun/22 11:46
Worklog Time Spent: 10m
Work Description: kasakrisz opened a new pull request, #3362:
URL: https://github.com/apache/hive/pull/3362
### What changes were proposed in this pull request?
Rewrite update statements of iceberg tables to multi insert statement
similarly in case of native acid tables.
When generating the rewritten statement:
* Get the virtual columns from the table's storage handler in case of non
native acid tables
* Include the old values to the select clause of the delete branch of the
multi insert statement.
When executing the multi insert:
* Two iceberg writers are used which produce a data delta file and a delete
delta file. The result of these writers should be merged into one
`FilesForCommit` if both writers are run in the same task.
* In case of more complex statements (ex. partitioned and/or bucketed) more
than one Tez task produces commit info so this patch enables storing all of
them.
* Every `FileSinkOperator` creates its own jobConf instance because the
iceberg write operation is stored in it and it is different in both instance.
### Why are the changes needed?
See #2855
+ Preparation for iceberg Merge implementation.
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
```
mvn test -Dtest.output.overwrite -DskipSparkTests
-Dtest=TestIcebergLlapLocalCliDriver -Dqfile=update_iceberg_partitioned_orc2.q
-pl itests/qtest-iceberg -Piceberg -Pitests
```
Issue Time Tracking
-------------------
Worklog Id: (was: 780788)
Remaining Estimate: 0h
Time Spent: 10m
> Iceberg integration: Perform update split early
> -----------------------------------------------
>
> Key: HIVE-26319
> URL: https://issues.apache.org/jira/browse/HIVE-26319
> Project: Hive
> Issue Type: Improvement
> Reporter: Krisztian Kasa
> Assignee: Krisztian Kasa
> Priority: Major
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Extend update split early to iceberg tables like in HIVE-21160 for native
> acid tables
--
This message was sent by Atlassian Jira
(v8.20.7#820007)