[ 
https://issues.apache.org/jira/browse/TEZ-2193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363512#comment-14363512
 ] 

Bikas Saha commented on TEZ-2193:
---------------------------------

You are right about numConsumerTasks.

The message for ScatterGather could be more informative that the current one. 
Sorry I should have made this comment earlier.
{code}PhysicalOutput should not be negative for ScatterGatherEdge{code}
Perhaps, something like - "scatter gather edge manager must have destination 
vertex task parallelism specified"

The Precondition will be true almost always, but the in order to call the 
Precondition we will be assembling 4-5 strings for the message argument, which 
is unnecessary. This can be a hot code path during job startup for 10K or more 
mappers when all their outputs are being setup on the single dispatcher thread. 
So optimizing a little bit would help. Maybe pre-assemble part of the string or 
we could do the check in an if stmt and throw in its body.

Looks good. +1. No need to another review after making changes for the above 
comments. This could potentially go all the way to 0.5 branch.
Thanks!


> Check returned value from EdgeManagerPlugin before using it
> -----------------------------------------------------------
>
>                 Key: TEZ-2193
>                 URL: https://issues.apache.org/jira/browse/TEZ-2193
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Jeff Zhang
>            Assignee: Jeff Zhang
>         Attachments: TEZ-2193-1.patch, TEZ-2193-2.patch, TEZ-2193-3.patch, 
> TEZ-2193-4.patch
>
>
> e.g. dag has vertices v1, v2 and shuffle edge between them, and v2 has custom 
> vertex manager and -1 parallelism. In this case v1's output spec may be has 
> -1 physical edge which will cause task hangs in TezChild.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to