[
https://issues.apache.org/jira/browse/HIVE-9490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14951442#comment-14951442
]
Ryan Blue commented on HIVE-9490:
---------------------------------
There's a patch available on PARQUET-382 that implements this for Parquet. Hive
would just need to take advantage of that.
> [Parquet] Support Alter Table/Partition Concatenate
> ---------------------------------------------------
>
> Key: HIVE-9490
> URL: https://issues.apache.org/jira/browse/HIVE-9490
> Project: Hive
> Issue Type: Sub-task
> Reporter: Dong Chen
> Assignee: Dong Chen
> Attachments: HIVE-9490.patch-testcase
>
>
> Parquet should support
> {{ALTER TABLE table_name \[PARTITION (partition_key = 'partition_value')\]
> CONCATENATE;}}
> If the table or partition contains many small Parquet files, then the above
> command will merge them into larger files. The merge should happen at row
> group level thereby avoiding the overhead of decompressing and decoding the
> data.
> It is only supported by RCFiles or ORCFiles now.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)