[jira] [Commented] (YARN-1054) Invalid state transition exception caught when tearing down a (mini) cluster

2015-05-01 Thread Xuan Gong (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14523626#comment-14523626
 ] 

Xuan Gong commented on YARN-1054:
-

[~steve_l] Which tests are you running when you got this exception ? Is this 
issue still valid ? If it is, could you share how we can re-produce this ?

 Invalid state transition exception caught when tearing down a (mini) cluster
 

 Key: YARN-1054
 URL: https://issues.apache.org/jira/browse/YARN-1054
 Project: Hadoop YARN
  Issue Type: Bug
  Components: nodemanager
Affects Versions: 2.1.1-beta
Reporter: Steve Loughran
Priority: Minor

 When I'm tearing down a MiniYARNCluster I get a stack trace warning that an 
 invalid state transition has been attempted
 {code}
 [CONTAINER_KILLED_ON_REQUEST]org.apache.hadoop.yarn.state.InvalidStateTransitonException:
  Invalid event: CONTAINER_KILLED_ON_REQUEST
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-1054) Invalid state transition exception caught when tearing down a (mini) cluster

2015-05-01 Thread Steve Loughran (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14523972#comment-14523972
 ] 

Steve Loughran commented on YARN-1054:
--

very old test by the look of things; Hoya (precursor to Slider) testing a live 
HBase cluster. 

I couldn't replicate this with existing code  haven't seen since.

It happened during teardown, so I don't know how critical it is -if you can't 
debug it from this log, best to close as cannot-reproduce

 Invalid state transition exception caught when tearing down a (mini) cluster
 

 Key: YARN-1054
 URL: https://issues.apache.org/jira/browse/YARN-1054
 Project: Hadoop YARN
  Issue Type: Bug
  Components: nodemanager
Affects Versions: 2.1.1-beta
Reporter: Steve Loughran
Priority: Minor

 When I'm tearing down a MiniYARNCluster I get a stack trace warning that an 
 invalid state transition has been attempted
 {code}
 [CONTAINER_KILLED_ON_REQUEST]org.apache.hadoop.yarn.state.InvalidStateTransitonException:
  Invalid event: CONTAINER_KILLED_ON_REQUEST
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-1054) Invalid state transition exception caught when tearing down a (mini) cluster

2013-08-09 Thread Steve Loughran (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13735545#comment-13735545
 ] 

Steve Loughran commented on YARN-1054:
--

logs with stack
{code}
2013-08-09 16:46:31,208 [main] INFO  yarn.cluster.YarnMiniClusterTestBase 
(YarnMiniClusterTestBase.groovy:describe(121)) - 
2013-08-09 16:46:31,208 [main] INFO  yarn.cluster.YarnMiniClusterTestBase 
(YarnMiniClusterTestBase.groovy:describe(122)) - ===
2013-08-09 16:46:31,208 [main] INFO  yarn.cluster.YarnMiniClusterTestBase 
(YarnMiniClusterTestBase.groovy:describe(123)) - teardown
2013-08-09 16:46:31,208 [main] INFO  yarn.cluster.YarnMiniClusterTestBase 
(YarnMiniClusterTestBase.groovy:describe(124)) - ===
2013-08-09 16:46:31,208 [main] INFO  yarn.cluster.YarnMiniClusterTestBase 
(YarnMiniClusterTestBase.groovy:describe(125)) - 
2013-08-09 16:46:31,216 [main] INFO  org.mortbay.log (Slf4jLog.java:info(67)) - 
Stopped SelectChannelConnector@stevel-2.local:0
2013-08-09 16:46:31,319 [main] INFO  hadoop.ipc.Server (Server.java:stop(2429)) 
- Stopping server on 57710
2013-08-09 16:46:31,324 [IPC Server listener on 57710] INFO  hadoop.ipc.Server 
(Server.java:run(720)) - Stopping IPC Server listener on 57710
2013-08-09 16:46:31,326 [IPC Server Responder] INFO  hadoop.ipc.Server 
(Server.java:run(866)) - Stopping IPC Server Responder
2013-08-09 16:46:31,334 [main] INFO  hadoop.ipc.Server (Server.java:stop(2429)) 
- Stopping server on 57711
2013-08-09 16:46:31,336 [IPC Server listener on 57711] INFO  hadoop.ipc.Server 
(Server.java:run(720)) - Stopping IPC Server listener on 57711
2013-08-09 16:46:31,336 [IPC Server Responder] INFO  hadoop.ipc.Server 
(Server.java:run(866)) - Stopping IPC Server Responder
2013-08-09 16:46:31,339 [Public Localizer] INFO  
containermanager.localizer.ResourceLocalizationService 
(ResourceLocalizationService.java:run(728)) - Public cache exiting
2013-08-09 16:46:31,340 [main] INFO  server.nodemanager.NodeManager 
(NodeManager.java:cleanupContainers(261)) - Containers still running on 
SHUTDOWN : [container_1376091973496_0001_01_01, 
container_1376091973496_0001_01_02]
2013-08-09 16:46:31,342 [main] INFO  server.nodemanager.NodeManager 
(NodeManager.java:cleanupContainers(270)) - Waiting for containers to be killed
2013-08-09 16:46:31,343 [AsyncDispatcher event handler] INFO  
containermanager.container.Container (ContainerImpl.java:handle(860)) - 
Container container_1376091973496_0001_01_01 transitioned from RUNNING to 
KILLING
2013-08-09 16:46:31,343 [AsyncDispatcher event handler] INFO  
containermanager.container.Container (ContainerImpl.java:handle(860)) - 
Container container_1376091973496_0001_01_02 transitioned from RUNNING to 
KILLING
2013-08-09 16:46:31,344 [AsyncDispatcher event handler] INFO  
containermanager.launcher.ContainerLaunch 
(ContainerLaunch.java:cleanupContainer(323)) - Cleaning up container 
container_1376091973496_0001_01_01
2013-08-09 16:46:31,386 [ContainersLauncher #0] WARN  
server.nodemanager.DefaultContainerExecutor 
(DefaultContainerExecutor.java:launchContainer(207)) - Exit code from container 
container_1376091973496_0001_01_01 is : 143
2013-08-09 16:46:31,403 [AsyncDispatcher event handler] INFO  
containermanager.launcher.ContainerLaunch 
(ContainerLaunch.java:cleanupContainer(323)) - Cleaning up container 
container_1376091973496_0001_01_02
2013-08-09 16:46:31,441 [ContainersLauncher #1] WARN  
server.nodemanager.DefaultContainerExecutor 
(DefaultContainerExecutor.java:launchContainer(207)) - Exit code from container 
container_1376091973496_0001_01_02 is : 143
2013-08-09 16:46:31,456 [AsyncDispatcher event handler] INFO  
containermanager.container.Container (ContainerImpl.java:handle(860)) - 
Container container_1376091973496_0001_01_01 transitioned from KILLING to 
CONTAINER_CLEANEDUP_AFTER_KILL
2013-08-09 16:46:31,457 [AsyncDispatcher event handler] WARN  
containermanager.container.Container (ContainerImpl.java:handle(856)) - Can't 
handle this event at current state: Current: [CONTAINER_CLEANEDUP_AFTER_KILL], 
eventType: [CONTAINER_KILLED_ON_REQUEST]
org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: 
CONTAINER_KILLED_ON_REQUEST at CONTAINER_CLEANEDUP_AFTER_KILL
at 
org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
at 
org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
at 
org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl.handle(ContainerImpl.java:853)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl.handle(ContainerImpl.java:73)
at