[ 
https://issues.apache.org/jira/browse/HIVE-27850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17782666#comment-17782666
 ] 

Dmitriy Fingerman commented on HIVE-27850:
------------------------------------------

Hi [~pvary],

Nice to e-meet you, I am from Hive team at Cloudera. I've being working on 
Iceberg Compaction feature for some time (currently, on the Major Compaction) 
by extending the support for Iceberg in Hive ACID compaction framework.

So far I added support for manual submission of major compaction request for 
Iceberg tables using alter table command and implemented Iceberg Major 
compaction worker that executes a Hive query which gets committed with 
truncating the Iceberg table and adding new compacted data files in the same 
transaction using Iceberg API.

I was told it is possible that you are also working on the same feature. I 
wanted to reach out to you to check and coordinate to make sure that we are not 
working on the same thing. Have you done anything on this task? What is your 
plan for implementing Iceberg compaction?

> Compaction for Iceberg tables
> -----------------------------
>
>                 Key: HIVE-27850
>                 URL: https://issues.apache.org/jira/browse/HIVE-27850
>             Project: Hive
>          Issue Type: New Feature
>          Components: Iceberg integration
>            Reporter: Dmitriy Fingerman
>            Assignee: Dmitriy Fingerman
>            Priority: Major
>
> Hive currently doesn't have the table compaction functionality. It would be 
> highly beneficial for performance to implement this feature because this 
> would create larger data files and eliminate positional delete files.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to