René Moser created CLOUDSTACK-8848:
--------------------------------------

             Summary: Unexpected VR reboot after out-of-band migration
                 Key: CLOUDSTACK-8848
                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-8848
             Project: CloudStack
          Issue Type: Bug
      Security Level: Public (Anyone can view this level - this is the default.)
          Components: VMware
    Affects Versions: 4.5.2, 4.6.0
            Reporter: René Moser
            Priority: Critical
             Fix For: 4.5.3


In some conditions, VR gets rebooted after a out of band migration was done on 
vCenter. 

Note, new global setting in 4.5.2 "VR reboot after out of band migration" is 
set to false and this looks more like a bug.

After a VR migration to a host _and_ when the VM power state report gathering 
is running, the VR (and also any user VM as well) will get into the 
"PowerReportMissing" because it is not on the expected host anymore. This is 
ok. 

But if the VM is a VR. it will be stopped and started again. That is what we 
see.

In can not be reproduced every time a migration was done, but it seems the 
problem is "powerReportMissing".

I grep-ed the source and found this line related

https://github.com/apache/cloudstack/blob/master/engine/orchestration/src/com/cloud/vm/VirtualMachineManagerImpl.java#L3702

and also it seems that the graceful period might be related, 
(I am not quite sure but this graceful period looks like way to optimistic to 
me. 2 ping.interval are 2 x 60s and the power-state-report is running every 60s)

https://github.com/apache/cloudstack/blob/master/engine/orchestration/src/com/cloud/vm/VirtualMachinePowerStateSyncImpl.java#L110

In case it is a user VM, we see in the logs that the state will be set to 
power-off, but the VM keeps running. After a while a new VM power state report 
is running and the state for the user VM gets updated to Running again.

I could not yet do much debugging, logs will be appended tomorrow. 






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to