[ 
https://issues.apache.org/jira/browse/TEZ-3271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15610181#comment-15610181
 ] 

Hitesh Shah commented on TEZ-3271:
----------------------------------

bq. This function throws TezException and IOException. Let me what the right 
thing to do in this particular situation.

The function could do explicit throws for both of the exceptions. On the 
VertexImpl,  a generic Exception catch would make sense to ensure that we dont 
hit a dispatcher error. 

{code}
            Iterator<TezTaskAttemptID> attempts = 
task.getAttempts().keySet().iterator();
2150                while (attempts.hasNext()) {
2151                  attempt = attempts.next();
2152                }
{code}
  - Relies on TaskImpl's impl of addAndScheduleAttempt() but you could just use 
size of the map to decipher the last attempt. 

TestVertexImpl changes:
  - createInvalidDAGPlan not needed?
  - "Assert.assertEquals(2, v6.numSuccessSourceAttemptCompletions);" - shoudnt 
this get 4 completions given that v6 expects task attempt completions for both 
tasks in v4 and v5?
  - Likewise for 
"testFailuresMaxPercentExceededSourceTaskAttemptCompletionEvents" - I would 
assume this will get 2 for tasks of v5? 

Sorry for the last minute review comment on testVertexFailuresMaxPercent - 
should we be using a 2-vertex DAG with the first vertex having a threshold and 
verifying that the second vertex ( using a shuffle edge ) completes 
successfully ? 
 



> Provide mapreduce failures.maxpercent equivalent
> ------------------------------------------------
>
>                 Key: TEZ-3271
>                 URL: https://issues.apache.org/jira/browse/TEZ-3271
>             Project: Apache Tez
>          Issue Type: New Feature
>            Reporter: Jonathan Eagles
>            Assignee: Jonathan Eagles
>         Attachments: Succeeded with Failures.png, TEZ-3271.1.patch, 
> TEZ-3271.2.patch, TEZ-3271.3.patch, TEZ-3271.4.patch, TEZ-3271.5.patch, 
> TEZ-3271.6.patch, TEZ-3271.7.patch, TEZ-3271.8.patch, TEZ-3271.9.patch
>
>
> There is a certain category of work that need not have 100% of tasks succeed 
> to cause the work to be considered a success. To meet that end, I propose we 
> provide a tez equivalent of mapreduce.map.failures.maxpercent and 
> mapreduce.reduce.failures.maxpercent. In this way a vertex will be considered 
> a success if the number of failures is below a configured threshold.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to