[
https://issues.apache.org/jira/browse/CLOUDSTACK-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14696921#comment-14696921
]
ASF GitHub Bot commented on CLOUDSTACK-8733:
--------------------------------------------
GitHub user koushik-das opened a pull request:
https://github.com/apache/cloudstack/pull/698
CLOUDSTACK-8733: Host stuck in rebalancing state during agent LB
This is happening as ClusterServiceServletAdapter is started after
ClusteredAgentManagerImpl.
Fix is to start ClusterServiceServletAdapter before
ClusteredAgentManagerImpl.
Also added a log message in SyncQueueManagerImpl.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/koushik-das/cloudstack CLOUDSTACK-8733
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/cloudstack/pull/698.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #698
----
commit c989921fb7adcfd125ca5f541c0f9c5d1c512c54
Author: Koushik Das <[email protected]>
Date: 2015-08-14T11:41:52Z
CLOUDSTACK-8733: Host stuck in rebalancing state during agent LB
This is happening as ClusterServiceServletAdapter is started after
ClusteredAgentManagerImpl.
Fix is to start ClusterServiceServletAdapter before
ClusteredAgentManagerImpl.
----
> Host stuck in rebalancing state during agent LB
> -----------------------------------------------
>
> Key: CLOUDSTACK-8733
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-8733
> Project: CloudStack
> Issue Type: Bug
> Security Level: Public(Anyone can view this level - this is the
> default.)
> Components: Management Server
> Affects Versions: 4.5.0
> Reporter: Koushik Das
> Assignee: Koushik Das
> Fix For: 4.6.0
>
>
> During agent LB, sometimes host may get stuck in rebalancing state due to the
> following failure in the rebalance process. This may happen if
> ClusterServiceServletAdapter (which starts the listener on port 9090 and
> processes agent LB related commands) is configured and started after
> ClusteredAgentManagerImpl (which initiates the agent LB process).
> INFO [commons.httpclient.HttpMethodDirector] (Cluster-Worker-4:ctx-00f55c44)
> I/O exception (java.net.ConnectException) caught when processing request:
> Connection refused
> INFO [commons.httpclient.HttpMethodDirector] (Cluster-Worker-4:ctx-00f55c44)
> Retrying request
> INFO [commons.httpclient.HttpMethodDirector] (Cluster-Worker-4:ctx-00f55c44)
> I/O exception (java.net.ConnectException) caught when processing request:
> Connection refused
> INFO [commons.httpclient.HttpMethodDirector] (Cluster-Worker-4:ctx-00f55c44)
> Retrying request
> INFO [commons.httpclient.HttpMethodDirector] (Cluster-Worker-4:ctx-00f55c44)
> I/O exception (java.net.ConnectException) caught when processing request:
> Connection refused
> INFO [commons.httpclient.HttpMethodDirector] (Cluster-Worker-4:ctx-00f55c44)
> Retrying request
> ERROR [cloud.cluster.ClusterServiceServletImpl]
> (Cluster-Worker-4:ctx-00f55c44) IOException from :
> http://10.208.62.111:9090/clusterservice, method : name=method, value=5
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)