[ https://issues.apache.org/jira/browse/MAPREDUCE-6898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16052325#comment-16052325 ]
Peter Bacsko commented on MAPREDUCE-6898: ----------------------------------------- [~jlowe] I just realized that this problem has already been solved by Haibo Chen: https://github.com/apache/hadoop/commit/0cfd7ad21f4457513ed3416e5d77f3123bfe9da0 Originally we detected this problem on a different branch and when I rebased my changes against apache/trunk I didn't see that there is already a solution :) You might want to revert this commit. > TestKill.testKillTask is flaky > ------------------------------ > > Key: MAPREDUCE-6898 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-6898 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: client, test > Reporter: Peter Bacsko > Assignee: Peter Bacsko > Fix For: 2.9.0, 3.0.0-alpha4, 2.8.2 > > Attachments: MAPREDUCE-6898-001.patch > > > TestKill.testKillTask() can fail if the async dispatcher thread is slower > than the test's thread. > {noformat} > 2017-05-26 11:43:26,532 INFO [AsyncDispatcher event handler] impl.JobImpl > (JobImpl.java:handle(1006)) - job_0_0000Job Transitioned from INITED to SETUP > Job State is : RUNNING > Job State is : RUNNING Waiting for state : SUCCEEDED map progress : 0.0 > reduce progress : 0.0 > 2017-05-26 11:43:26,538 INFO [CommitterEvent Processor #0] > commit.CommitterEventHandler (CommitterEventHandler.java:run(231)) - > Processing the event EventType: JOB_SETUP > 2017-05-26 11:43:26,540 INFO [AsyncDispatcher event handler] impl.TaskImpl > (TaskImpl.java:handle(661)) - task_0_0000_m_000000 Task Transitioned from NEW > to KILLED > 2017-05-26 11:43:26,540 ERROR [AsyncDispatcher event handler] impl.JobImpl > (JobImpl.java:handle(998)) - Can't handle this event at current state > org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: > JOB_TASK_COMPLETED at SETUP > at > org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305) > at > org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46) > at > org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448) > at > org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl.handle(JobImpl.java:996) > at > org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl.handle(JobImpl.java:138) > at > org.apache.hadoop.mapreduce.v2.app.MRAppMaster$JobEventDispatcher.handle(MRAppMaster.java:1366) > at > org.apache.hadoop.mapreduce.v2.app.MRAppMaster$JobEventDispatcher.handle(MRAppMaster.java:1362) > at > org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:182) > at > org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:109) > at java.lang.Thread.run(Thread.java:745) > 2017-05-26 11:43:26,541 INFO [AsyncDispatcher event handler] impl.JobImpl > (JobImpl.java:handle(1006)) - job_0_0000Job Transitioned from SETUP to ERROR > 2017-05-26 11:43:26,542 INFO [AsyncDispatcher event handler] app.MRAppMaster > (MRAppMaster.java:serviceStop(978)) - Skipping cleaning up the staging dir. > assuming AM will be retried. > {noformat} > We have to wait until the job's internal state is > {{JobInternalState.RUNNING}} and not {{JobInternalState.SETUP}}. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org