[
https://issues.apache.org/jira/browse/HIVE-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joydeep Sen Sarma resolved HIVE-110.
------------------------------------
Resolution: Duplicate
> table/partition compaction utility
> ----------------------------------
>
> Key: HIVE-110
> URL: https://issues.apache.org/jira/browse/HIVE-110
> Project: Hadoop Hive
> Issue Type: Bug
> Reporter: Joydeep Sen Sarma
>
> in many cases, we cannot help producing a large number of output files in a
> final result table. This can put pressure on the namenode and as well makes
> downstream queries potentially inefficient (until we have
> combinefileinputformat). Users would like to have a simple utility in hive to
> reduce the number of files by concatenating them together. This should apply
> on a per partition basis.
> Long term - Hive should do this kind of compaction automatically - either
> online or offline.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.