[ https://issues.apache.org/jira/browse/TEZ-4062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jonathan Eagles updated TEZ-4062: --------------------------------- Attachment: TEZ-4062.001.patch > Speculative attempt scheduling should be aborted when Task has complelted > ------------------------------------------------------------------------- > > Key: TEZ-4062 > URL: https://issues.apache.org/jira/browse/TEZ-4062 > Project: Apache Tez > Issue Type: Bug > Reporter: Yingda Chen > Assignee: Ying Han > Priority: Major > Attachments: TEZ-4062.001.patch > > Time Spent: 1h 10m > Remaining Estimate: 0h > > In RedundantScheduleTransition (inside TaskImpl), we try to find the oldest > running attempt and use it as the causual attempt when doing > "addAndScheduleAttempt". > > However, the task may have completed at this moment, i.e., the task attempt > that was considered running and long-tailed by speculator is now completed. > In this case, we may not be able to find any unfinished attempt, which will > lead to NPE in following logic (even without NPE, it still makes no sense to > proceed with scheduling speculative attempt anyway) -- This message was sent by Atlassian JIRA (v7.6.3#76005)