[ 
https://issues.apache.org/jira/browse/TEZ-3336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15373690#comment-15373690
 ] 

Jason Lowe commented on TEZ-3336:
---------------------------------

Seems like one fix would be to simply have the MR input initializers ignore 
events rather than explode.  I'm guessing those initializers do not care at all 
about what anything else is doing -- they just want to compute splits based 
purely on the MR input.

> Hive map-side join job sometimes fails with ROOT_INPUT_INIT_FAILURE
> -------------------------------------------------------------------
>
>                 Key: TEZ-3336
>                 URL: https://issues.apache.org/jira/browse/TEZ-3336
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.7.1
>            Reporter: Jason Lowe
>
> When Hive does a map-side join it can generate a DAG where a vertex has two 
> inputs, one from an upstream task and another using MRInputAMSplitGenerator.  
> If it takes a while for MRInputAMSplitGenerator to compute the splits and one 
> of the tasks for the other upstream vertex completes then the job can fail 
> with an error since MRInputAMSplitGenerator does not expect to receive any 
> events.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to