[ 
https://issues.apache.org/jira/browse/SLIDER-748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273595#comment-14273595
 ] 

Steve Loughran commented on SLIDER-748:
---------------------------------------

curator is running on a separate thread, and there's no easy way to check that 
ZK is running ... the sole {{isRunning()}} probe is unreachable. 

{code}
2015-01-12 12:11:22,513 [main] DEBUG state.AppState 
(AppState.java:onInstanceDefinitionUpdated(651)) - Instance definition updated
2015-01-12 12:11:22,539 [main] DEBUG state.AppState 
(AppState.java:resetFailureCounts(1714)) - Resetting failure count of 
slider-appmaster; was 0
2015-01-12 12:11:22,539 [main] DEBUG appmaster.SliderAppMaster 
(SliderAppMaster.java:reviewRequestAndReleaseNodes(1642)) - 
reviewRequestAndReleaseNodes(flexCluster)
2015-01-12 12:11:22,539 [main] DEBUG actions.QueueService 
(QueueService.java:put(85)) - Queueing 
org.apache.slider.server.appmaster.actions.ReviewAndFlexApplicationSize@3eec44a6
 name='flexCluster', delay=0, attrs=4, sequenceNumber=3}
2015-01-12 12:11:22,540 [main] DEBUG appmaster.SliderAppMaster 
(SliderAppMaster.java:waitForAMCompletionSignal(1361)) - blocking until 
signalled to terminate
2015-01-12 12:11:22,540 [AmExecutor-006] DEBUG actions.QueueExecutor 
(QueueExecutor.java:run(71)) - Executing 
org.apache.slider.server.appmaster.actions.ReviewAndFlexApplicationSize@3eec44a6
 name='flexCluster', delay=0, attrs=4, sequenceNumber=3}
2015-01-12 12:11:22,540 [AmExecutor-006] DEBUG appmaster.SliderAppMaster 
(SliderAppMaster.java:executeNodeReview(1677)) - in 
executeNodeReview(flexCluster)
2015-01-12 12:11:22,540 [AmExecutor-006] DEBUG state.AppState 
(AppState.java:reviewRequestAndReleaseNodes(1656)) - in 
reviewRequestAndReleaseNodes()
2015-01-12 12:11:22,541 [AmExecutor-006] DEBUG actions.QueueExecutor 
(QueueExecutor.java:run(74)) - Completed 
org.apache.slider.server.appmaster.actions.ReviewAndFlexApplicationSize@3eec44a6
 name='flexCluster', delay=0, attrs=4, sequenceNumber=3}
2015-01-12 12:12:10,927 [main-EventThread] INFO  state.ConnectionStateManager 
(ConnectionStateManager.java:postState(194)) - State change: SUSPENDED
2015-01-12 12:12:10,928 [ConnectionStateManager-0] WARN  
state.ConnectionStateManager (ConnectionStateManager.java:processEvents(212)) - 
There are no ConnectionStateListeners registered.
2015-01-12 12:12:20,794 [Thread-13] DEBUG agent.HeartbeatMonitor 
(HeartbeatMonitor.java:run(65)) - Putting monitor to sleep for 60000 
milliseconds
2015-01-12 12:12:26,538 [CuratorFramework-0] ERROR curator.ConnectionState 
(ConnectionState.java:checkTimeouts(201)) - Connection timed out for connection 
string (localhost:65385) and timeout (15000) / elapsed (15607)
org.apache.curator.CuratorConnectionLossException: KeeperErrorCode = 
ConnectionLoss
        at 
org.apache.curator.ConnectionState.checkTimeouts(ConnectionState.java:198)
        at 
org.apache.curator.ConnectionState.getZooKeeper(ConnectionState.java:88)
        at 
org.apache.curator.CuratorZookeeperClient.getZooKeeper(CuratorZookeeperClient.java:113)
        at 
org.apache.curator.framework.imps.CuratorFrameworkImpl.performBackgroundOperation(CuratorFrameworkImpl.java:763)
        at 
org.apache.curator.framework.imps.CuratorFrameworkImpl.backgroundOperationsLoop(CuratorFrameworkImpl.java:749)
        at 
org.apache.curator.framework.imps.CuratorFrameworkImpl.access$300(CuratorFrameworkImpl.java:56)
        at 
org.apache.curator.framework.imps.CuratorFrameworkImpl$3.call(CuratorFrameworkImpl.java:244)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
2015-01-12 12:12:28,543 [CuratorFramework-0] ERROR curator.ConnectionState 
(ConnectionState.java:checkTimeouts(201)) - Connection timed out for connection 
string (localhost:65385) and timeout (15000) / elapsed (17613)
org.apache.curator.CuratorConnectionLossException: KeeperErrorCode = 
ConnectionLoss
{code}

> TestAgentAMManagementWS.testAgentAMManagementWS failing
> -------------------------------------------------------
>
>                 Key: SLIDER-748
>                 URL: https://issues.apache.org/jira/browse/SLIDER-748
>             Project: Slider
>          Issue Type: Sub-task
>          Components: Web & REST
>    Affects Versions: Slider 0.70
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Critical
>             Fix For: Slider 0.70
>
>   Original Estimate: 1h
>          Time Spent: 0.5h
>  Remaining Estimate: 1h
>
> {{TestAgentAMManagementWS.testAgentAMManagementWS}} failing on jenkins. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to