[
https://issues.apache.org/jira/browse/CLOUDSTACK-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15422276#comment-15422276
]
ASF GitHub Bot commented on CLOUDSTACK-9458:
--------------------------------------------
GitHub user marcaurele opened a pull request:
https://github.com/apache/cloudstack/pull/1640
CLOUDSTACK-9458: Fix HA bug when VMs are stopped on agent disconnect
VM are being restarted even if they don't have HA enabled. If the agent
gets back online before the command expires some VMs are being stopped
and not even restarted.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/exoscale/cloudstack bug/CLOUDSTACK-9458
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/cloudstack/pull/1640.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1640
----
commit c8f57541006536c4719044405b7691fad244fbf7
Author: Marc-Aurèle Brothier <[email protected]>
Date: 2016-08-16T06:10:51Z
CLOUDSTACK-9458: Fix HA bug when VMs are stopped on agent disconnect
VM are being restarted even if they don't have HA enabled. If the agent
gets back online before the command expires some VMs are being stopped
and not even restarted.
----
> Some VMs are being stopped when agent is reconnecting
> -----------------------------------------------------
>
> Key: CLOUDSTACK-9458
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-9458
> Project: CloudStack
> Issue Type: Bug
> Security Level: Public(Anyone can view this level - this is the
> default.)
> Reporter: Marc-Aurèle Brothier
>
> If you loose the communication between the management server and one of the
> agent for a few minutes, even though HA mode is not active the
> HighAvailibilityManager kicks in and start to schedule vm restart. Those
> tasks are being inserted as async job in the DB and if the agent comes back
> online during the time the jobs are still in the async table, they are pushed
> to the agent and shuts down the VMs. Then since HA is not active, the VM are
> not restarted.
> The expected behavior in my opinion is that the VM should not be restarted at
> all if HA mode is not active on them, and let the agent update the VM state
> with the power report.
> The bug lies in
> {{HighAvailibilityManagerImpl.scheduleRestartForVmsOnHost(final HostVO host,
> boolean investigate)}}, PR will follow.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)