[
https://issues.apache.org/jira/browse/HIVE-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan Wu updated HIVE-1355:
--------------------------
Description:
* see https://issues.apache.org/jira/browse/MAPREDUCE-1802
hive doesn't depend on hadoop job output folder. it produces output exclusively
via side effect folders. we should use an outputformat that can request hadoop
skip cleanup/setup. this could be nulloutputformat (unless there are any
objections in hadoop to changing nulloutputformat behavior).
as a small side effect, it also avoids some totally unnecessary hdfs file
creates and deletes in hdfs.
was:
see https://issues.apache.org/jira/browse/MAPREDUCE-1802
hive doesn't depend on hadoop job output folder. it produces output exclusively
via side effect folders. we should use an outputformat that can request hadoop
skip cleanup/setup. this could be nulloutputformat (unless there are any
objections in hadoop to changing nulloutputformat behavior).
as a small side effect, it also avoids some totally unnecessary hdfs file
creates and deletes in hdfs.
> Hive should use NullOutputFormat for hadoop jobs
> ------------------------------------------------
>
> Key: HIVE-1355
> URL: https://issues.apache.org/jira/browse/HIVE-1355
> Project: Hive
> Issue Type: Improvement
> Components: Query Processor
> Reporter: Joydeep Sen Sarma
> Assignee: Joydeep Sen Sarma
> Priority: Major
> Fix For: 0.6.0
>
> Attachments: 1355.1.patch
>
>
> * see https://issues.apache.org/jira/browse/MAPREDUCE-1802
> hive doesn't depend on hadoop job output folder. it produces output
> exclusively via side effect folders. we should use an outputformat that can
> request hadoop skip cleanup/setup. this could be nulloutputformat (unless
> there are any objections in hadoop to changing nulloutputformat behavior).
> as a small side effect, it also avoids some totally unnecessary hdfs file
> creates and deletes in hdfs.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)