[
https://issues.apache.org/jira/browse/HIVE-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mithun Radhakrishnan updated HIVE-11456:
----------------------------------------
Attachment: HIVE-11456.1.patch
The fix. The hard-codes for {{part}} in the file-names have been switched to
use the conf-setting for {{"mapreduce.output.basename"}}.
Sorry for the delay.
> HCatStorer should honor mapreduce.output.basename
> -------------------------------------------------
>
> Key: HIVE-11456
> URL: https://issues.apache.org/jira/browse/HIVE-11456
> Project: Hive
> Issue Type: Bug
> Affects Versions: 1.2.0
> Reporter: Rohini Palaniswamy
> Assignee: Mithun Radhakrishnan
> Priority: Critical
> Attachments: HIVE-11456.1.patch
>
>
> Pig on Tez scripts with union directly followed by HCatStorer have a problem
> due to HCatStorer not honoring mapreduce.output.basename and always using
> part. Tez sets mapreduce.output.basename to part-v000-o000 (vertex id
> followed by output id). With union optimizer, Pig uses vertex groups to write
> directly from both the vertices to the final output directory. Since hcat
> ignores the mapreduce.output.basename, both the vertices produce
> part-r-0000<n> and when they are moved from the temp location to the final
> directory, they just overwrite each other. There is no failure and only one
> of the files with that name makes it into the final directory.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)