[ 
https://issues.apache.org/jira/browse/HIVE-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joydeep Sen Sarma resolved HIVE-110.
------------------------------------

    Resolution: Duplicate

> table/partition compaction utility
> ----------------------------------
>
>                 Key: HIVE-110
>                 URL: https://issues.apache.org/jira/browse/HIVE-110
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Joydeep Sen Sarma
>
> in many cases, we cannot help producing a large number of output files in a 
> final result table. This can put pressure on the namenode and as well makes 
> downstream queries potentially inefficient (until we have 
> combinefileinputformat). Users would like to have a simple utility in hive to 
> reduce the number of files by concatenating them together. This should apply 
> on a per partition basis.
> Long term - Hive should do this kind of compaction automatically - either 
> online or offline.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to