[ https://issues.apache.org/jira/browse/TEZ-3271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15610181#comment-15610181 ]
Hitesh Shah commented on TEZ-3271: ---------------------------------- bq. This function throws TezException and IOException. Let me what the right thing to do in this particular situation. The function could do explicit throws for both of the exceptions. On the VertexImpl, a generic Exception catch would make sense to ensure that we dont hit a dispatcher error. {code} Iterator<TezTaskAttemptID> attempts = task.getAttempts().keySet().iterator(); 2150 while (attempts.hasNext()) { 2151 attempt = attempts.next(); 2152 } {code} - Relies on TaskImpl's impl of addAndScheduleAttempt() but you could just use size of the map to decipher the last attempt. TestVertexImpl changes: - createInvalidDAGPlan not needed? - "Assert.assertEquals(2, v6.numSuccessSourceAttemptCompletions);" - shoudnt this get 4 completions given that v6 expects task attempt completions for both tasks in v4 and v5? - Likewise for "testFailuresMaxPercentExceededSourceTaskAttemptCompletionEvents" - I would assume this will get 2 for tasks of v5? Sorry for the last minute review comment on testVertexFailuresMaxPercent - should we be using a 2-vertex DAG with the first vertex having a threshold and verifying that the second vertex ( using a shuffle edge ) completes successfully ? > Provide mapreduce failures.maxpercent equivalent > ------------------------------------------------ > > Key: TEZ-3271 > URL: https://issues.apache.org/jira/browse/TEZ-3271 > Project: Apache Tez > Issue Type: New Feature > Reporter: Jonathan Eagles > Assignee: Jonathan Eagles > Attachments: Succeeded with Failures.png, TEZ-3271.1.patch, > TEZ-3271.2.patch, TEZ-3271.3.patch, TEZ-3271.4.patch, TEZ-3271.5.patch, > TEZ-3271.6.patch, TEZ-3271.7.patch, TEZ-3271.8.patch, TEZ-3271.9.patch > > > There is a certain category of work that need not have 100% of tasks succeed > to cause the work to be considered a success. To meet that end, I propose we > provide a tez equivalent of mapreduce.map.failures.maxpercent and > mapreduce.reduce.failures.maxpercent. In this way a vertex will be considered > a success if the number of failures is below a configured threshold. -- This message was sent by Atlassian JIRA (v6.3.4#6332)