[ 
https://issues.apache.org/jira/browse/HIVE-25849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marton Bod updated HIVE-25849:
------------------------------
    Description: 
Insert overwrite should be disabled where the target Iceberg table is a bucket 
partitioned table, since which existing partitions will be overwritten is very 
hard to predict from a user's POV, as it depends on the bucket hash values 
calculated for the new dataset's rows. It's better to be on the safe side and 
disable this operation to avoid unwanted data loss.

Note: this the same approach followed by Impala too.

  was:Insert overwrite should be disabled where the target Iceberg table is a 
bucket partitioned table, since which existing partitions will be overwritten 
is very hard to predict from a user's POV, as it depends on the bucket hash 
values calculated for the new dataset's rows. It's better to be on the safe 
side and disable this operation to avoid unwanted data loss.


> Disable insert overwrite for bucket partitioned Iceberg tables
> --------------------------------------------------------------
>
>                 Key: HIVE-25849
>                 URL: https://issues.apache.org/jira/browse/HIVE-25849
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Marton Bod
>            Assignee: Marton Bod
>            Priority: Major
>
> Insert overwrite should be disabled where the target Iceberg table is a 
> bucket partitioned table, since which existing partitions will be overwritten 
> is very hard to predict from a user's POV, as it depends on the bucket hash 
> values calculated for the new dataset's rows. It's better to be on the safe 
> side and disable this operation to avoid unwanted data loss.
> Note: this the same approach followed by Impala too.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to