[ 
https://issues.apache.org/jira/browse/TEZ-2726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14700369#comment-14700369
 ] 

Rajesh Balamohan commented on TEZ-2726:
---------------------------------------

[~saikatr] - Is there any repro for this? When you say invalid headers, is it 
something like the following? Can you plz provide more info?

{noformat}
org.apache.tez.runtime.library.common.shuffle.impl.Fetcher: Invalid map id
java.lang.IllegalArgumentException: Invalid header received: 
W^s??.attempt_1399351577718_4169_1_ partition: 95
{noformat}

If so, are you using "tez.runtime.intermediate-output.compress.codec = 
org.apache.hadoop.io.compress.DefaultCodec" ?

> Handle invalid number of partitions for SCATTER-GATHER edge
> -----------------------------------------------------------
>
>                 Key: TEZ-2726
>                 URL: https://issues.apache.org/jira/browse/TEZ-2726
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Saikat
>            Assignee: Saikat
>
> Encountered an issue where the source vertex has M task and sink vertex has N 
> tasks (N > M), [e.g. M = 1, N = 3]and the edge is of type SCATTER -GATHER.
> This resulted in sink vertex receiving DMEs with non existent targetIds.
> The fetchers for the sink vertex tasks then try to retrieve the map outputs 
> and retrieve invalid headers due to exception in the ShuffleHandler.
> Possible fixes:
> 1. raise proper Tez Exception to indicate this invalid scenario.
> 2. or write appropriate empty partition bits, for the missing partitions 
> before sending out the DMEs to sink vertex. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to