[
https://issues.apache.org/jira/browse/SLIDER-748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273595#comment-14273595
]
Steve Loughran commented on SLIDER-748:
---------------------------------------
curator is running on a separate thread, and there's no easy way to check that
ZK is running ... the sole {{isRunning()}} probe is unreachable.
{code}
2015-01-12 12:11:22,513 [main] DEBUG state.AppState
(AppState.java:onInstanceDefinitionUpdated(651)) - Instance definition updated
2015-01-12 12:11:22,539 [main] DEBUG state.AppState
(AppState.java:resetFailureCounts(1714)) - Resetting failure count of
slider-appmaster; was 0
2015-01-12 12:11:22,539 [main] DEBUG appmaster.SliderAppMaster
(SliderAppMaster.java:reviewRequestAndReleaseNodes(1642)) -
reviewRequestAndReleaseNodes(flexCluster)
2015-01-12 12:11:22,539 [main] DEBUG actions.QueueService
(QueueService.java:put(85)) - Queueing
org.apache.slider.server.appmaster.actions.ReviewAndFlexApplicationSize@3eec44a6
name='flexCluster', delay=0, attrs=4, sequenceNumber=3}
2015-01-12 12:11:22,540 [main] DEBUG appmaster.SliderAppMaster
(SliderAppMaster.java:waitForAMCompletionSignal(1361)) - blocking until
signalled to terminate
2015-01-12 12:11:22,540 [AmExecutor-006] DEBUG actions.QueueExecutor
(QueueExecutor.java:run(71)) - Executing
org.apache.slider.server.appmaster.actions.ReviewAndFlexApplicationSize@3eec44a6
name='flexCluster', delay=0, attrs=4, sequenceNumber=3}
2015-01-12 12:11:22,540 [AmExecutor-006] DEBUG appmaster.SliderAppMaster
(SliderAppMaster.java:executeNodeReview(1677)) - in
executeNodeReview(flexCluster)
2015-01-12 12:11:22,540 [AmExecutor-006] DEBUG state.AppState
(AppState.java:reviewRequestAndReleaseNodes(1656)) - in
reviewRequestAndReleaseNodes()
2015-01-12 12:11:22,541 [AmExecutor-006] DEBUG actions.QueueExecutor
(QueueExecutor.java:run(74)) - Completed
org.apache.slider.server.appmaster.actions.ReviewAndFlexApplicationSize@3eec44a6
name='flexCluster', delay=0, attrs=4, sequenceNumber=3}
2015-01-12 12:12:10,927 [main-EventThread] INFO state.ConnectionStateManager
(ConnectionStateManager.java:postState(194)) - State change: SUSPENDED
2015-01-12 12:12:10,928 [ConnectionStateManager-0] WARN
state.ConnectionStateManager (ConnectionStateManager.java:processEvents(212)) -
There are no ConnectionStateListeners registered.
2015-01-12 12:12:20,794 [Thread-13] DEBUG agent.HeartbeatMonitor
(HeartbeatMonitor.java:run(65)) - Putting monitor to sleep for 60000
milliseconds
2015-01-12 12:12:26,538 [CuratorFramework-0] ERROR curator.ConnectionState
(ConnectionState.java:checkTimeouts(201)) - Connection timed out for connection
string (localhost:65385) and timeout (15000) / elapsed (15607)
org.apache.curator.CuratorConnectionLossException: KeeperErrorCode =
ConnectionLoss
at
org.apache.curator.ConnectionState.checkTimeouts(ConnectionState.java:198)
at
org.apache.curator.ConnectionState.getZooKeeper(ConnectionState.java:88)
at
org.apache.curator.CuratorZookeeperClient.getZooKeeper(CuratorZookeeperClient.java:113)
at
org.apache.curator.framework.imps.CuratorFrameworkImpl.performBackgroundOperation(CuratorFrameworkImpl.java:763)
at
org.apache.curator.framework.imps.CuratorFrameworkImpl.backgroundOperationsLoop(CuratorFrameworkImpl.java:749)
at
org.apache.curator.framework.imps.CuratorFrameworkImpl.access$300(CuratorFrameworkImpl.java:56)
at
org.apache.curator.framework.imps.CuratorFrameworkImpl$3.call(CuratorFrameworkImpl.java:244)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
2015-01-12 12:12:28,543 [CuratorFramework-0] ERROR curator.ConnectionState
(ConnectionState.java:checkTimeouts(201)) - Connection timed out for connection
string (localhost:65385) and timeout (15000) / elapsed (17613)
org.apache.curator.CuratorConnectionLossException: KeeperErrorCode =
ConnectionLoss
{code}
> TestAgentAMManagementWS.testAgentAMManagementWS failing
> -------------------------------------------------------
>
> Key: SLIDER-748
> URL: https://issues.apache.org/jira/browse/SLIDER-748
> Project: Slider
> Issue Type: Sub-task
> Components: Web & REST
> Affects Versions: Slider 0.70
> Reporter: Steve Loughran
> Assignee: Steve Loughran
> Priority: Critical
> Fix For: Slider 0.70
>
> Original Estimate: 1h
> Time Spent: 0.5h
> Remaining Estimate: 1h
>
> {{TestAgentAMManagementWS.testAgentAMManagementWS}} failing on jenkins.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)