[
https://issues.apache.org/jira/browse/TEZ-3336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15375941#comment-15375941
]
Mithun Radhakrishnan commented on TEZ-3336:
-------------------------------------------
bq. It has it's own SplitGenerator which is based on MRInputSplitGeneartor
The {{HiveSplitGenerator}}. I'll trace this code and report back. I guess I'm
not clear on how {{MRInputSplitGenerator::handleInputInitializerEvent()}} was
called at all.
I should mention that this case was using {{CombineHiveInputFormat}}, which
should've circumvented Tez split-grouping.
> Hive map-side join job sometimes fails with ROOT_INPUT_INIT_FAILURE
> -------------------------------------------------------------------
>
> Key: TEZ-3336
> URL: https://issues.apache.org/jira/browse/TEZ-3336
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.7.1
> Reporter: Jason Lowe
>
> When Hive does a map-side join it can generate a DAG where a vertex has two
> inputs, one from an upstream task and another using MRInputAMSplitGenerator.
> If it takes a while for MRInputAMSplitGenerator to compute the splits and one
> of the tasks for the other upstream vertex completes then the job can fail
> with an error since MRInputAMSplitGenerator does not expect to receive any
> events.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)