Prieur Leary created CLOUDSTACK-7457:
----------------------------------------
Summary: Unable to launch VM after unexpected Hypervisor Reboot
(out of band)
Key: CLOUDSTACK-7457
URL: https://issues.apache.org/jira/browse/CLOUDSTACK-7457
Project: CloudStack
Issue Type: Bug
Security Level: Public (Anyone can view this level - this is the default.)
Components: KVM, Management Server
Affects Versions: 4.4.0
Environment: CentOS
Reporter: Prieur Leary
Priority: Critical
After an unexpected hypervisor server reboot (server crash), certain VMs fail
to start and return, "Was unable to find lock for the key vm_instance1355"
(full log below).
I suspect this is related to the VM state being out of sync. As it stands, am
searching for a way to work around this issue, should anyone care to provide
some insight.
------------------------------------
2014-08-31 11:14:33,381 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Checking if host: 28
has enough capacity for requested CPU: 30 and requested RA$
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Hosts's actual total
CPU: 19992 and CPU after applying overprovisioning: 119952
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) We need to allocate
to the last host again, so checking if there is enough rese$
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Reserved CPU: 30 ,
Requested CPU: 30
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Reserved RAM:
33554432 , Requested RAM: 33554432
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Host has enough CPU
and RAM available
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) STATS: Can alloc CPU
from host: 28, used: 6000, reserved: 30, actual total: 199$
2014-08-31 11:14:33,383 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) STATS: Can alloc MEM
from host: 28, used: 5368709120, reserved: 33554432, total$
2014-08-31 11:14:33,384 DEBUG [c.c.c.CapacityManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Host: 28 has cpu
capability (cpu:8, speed:2499) to support requested CPU: 1 and$
2014-08-31 11:14:33,384 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) The last host of
this VM is UP and has enough capacity
2014-08-31 11:14:33,384 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Now checking for
suitable pools under zone: 1, pod: 1, cluster: 2
2014-08-31 11:14:33,385 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Checking suitable
pools for volume (Id, Type): (1549,ROOT)
2014-08-31 11:14:33,385 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Volume has pool
already allocated, checking if pool can be reused, po$
2014-08-31 11:14:33,387 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Planner need not
allocate a pool for this volume since its READY
2014-08-31 11:14:33,387 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Trying to find a
potenial host and associated storage pools from the $
2014-08-31 11:14:33,388 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Checking if host: 28
can access any suitable storage pool for volume:$
2014-08-31 11:14:33,388 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Host: 28 can access
pool: 1
2014-08-31 11:14:33,389 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Found a potential
host id: 28 name: MC2HOST15.fortatrust.com and asso$
2014-08-31 11:14:33,389 DEBUG [c.c.d.DeploymentPlanningManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Returning Deployment
Destination: Dest[Zone(Id)-Pod(Id)-Cluster(Id)-H$
2014-08-31 11:14:33,478 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Sync job-201795
execution on object VmWorkJobQueue.1355
2014-08-31 11:14:33,479 WARN [c.c.u.d.Merovingian2]
(API-Job-Executor-14:ctx-17edc0e6 job-201794 ctx-25906915) Was unable to find
lock for the key vm_instance1355 and thread id 1146336087
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)