Sangeetha Hariharan created CLOUDSTACK-5430:
-----------------------------------------------

             Summary: KVM - Primary store down - Not abel to start Vms/take 
snapshots after the primary store is brought down and brough back up again.
                 Key: CLOUDSTACK-5430
                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-5430
             Project: CloudStack
          Issue Type: Bug
      Security Level: Public (Anyone can view this level - this is the default.)
          Components: Management Server
    Affects Versions: 4.3.0
         Environment: Build from 4.3
            Reporter: Sangeetha Hariharan
            Assignee: edison su
            Priority: Critical
             Fix For: 4.3.0


KVM - Primary store down - Not abel to start Vms/take snapshots after the 
primary store is brought down and brough back up again.

Set up:

Advanced zone with KVM (RHEL 6.3) hosts.

Steps to reproduce the problem:

1. Deploy few Vms in each of the hosts with 10 GB ROOT volume size , so we 
start with 10 Vms.
2. Create snaposhot for ROOT volumes.
3. When snapshot is still in progress , Make the primary storage unavailable 
for 10 mts.

This results in the KVM hosts to reboot.

But reboot of KVM host is not successful.It is stuck at trying to unmount nfs 
mount points. This is tracked in CLOUDSTACK-5429.

Stop and start KM hosts manually to workaround this problem.

At this point all the Vms are marked as "Stopped" state in CloudStack.

4. Now make the primary store available.

5. Attempt to start the VM.
It fails to start with the following exception:

2013-12-09 20:35:55,891 DEBUG [c.c.a.t.Request] (AgentManager-Handler-2:null) 
Seq 2-1983250480: Processing:  { Ans
: , MgmtId: 82324189320212, via: 2, Ver: v1, Flags: 10, 
[{"com.cloud.agent.api.Answer":{"result":false,"details":"
java.lang.NullPointerException\n\tat 
com.cloud.hypervisor.kvm.resource.LibvirtComputingResource.execute(LibvirtCom
putingResource.java:2488)\n\tat 
com.cloud.hypervisor.kvm.resource.LibvirtComputingResource.executeRequest(LibvirtC
omputingResource.java:1260)\n\tat 
com.cloud.agent.Agent.processRequest(Agent.java:498)\n\tat com.cloud.agent.Agent
$AgentRequestHandler.doTask(Agent.java:806)\n\tat 
com.cloud.utils.nio.Task.run(Task.java:83)\n\tat java.util.concu
rrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)\n\tat 
java.util.concurrent.ThreadPoolExecutor$Wor
ker.run(ThreadPoolExecutor.java:603)\n\tat 
java.lang.Thread.run(Thread.java:679)\n","wait":0}}] }
2013-12-09 20:35:55,891 DEBUG [c.c.a.t.Request] (StatsCollector-3:ctx-f0d35c47) 
Seq 2-1983250480: Received:  { Ans
: , MgmtId: 82324189320212, via: 2, Ver: v1, Flags: 10, { Answer } }
2013-12-09 20:35:56,939 DEBUG [c.c.a.ApiServlet] 
(catalina-exec-13:ctx-35adede4) ===START===  10.216.50.147 -- GET
  
command=queryAsyncJobResult&jobId=489806e9-96f9-4940-9ea0-6bd9516aabb0&response=json&sessionkey=qRSeXYRCfc1PSAXc
omRT8ue1f%2BE%3D&_=1386639381768
2013-12-09 20:35:56,953 DEBUG [c.c.a.ApiServlet] (catalina-exec-13:ctx-35adede4 
ctx-065180b8) ===END===  10.216.50
.147 -- GET  
command=queryAsyncJobResult&jobId=489806e9-96f9-4940-9ea0-6bd9516aabb0&response=json&sessionkey=qRSeX
YRCfc1PSAXcomRT8ue1f%2BE%3D&_=1386639381768
2013-12-09 20:35:59,322 DEBUG [c.c.a.t.Request] (AgentManager-Handler-14:null) 
Seq 1-539557989: Processing:  { Ans
: , MgmtId: 82324189320212, via: 1, Ver: v1, Flags: 10, 
[{"com.cloud.agent.api.Answer":{"result":false,"details":"
java.lang.NullPointerException\n\tat 
com.cloud.hypervisor.kvm.storage.KVMStoragePoolManager.disconnectPhysicalDisk
sViaVmSpec(KVMStoragePoolManager.java:181)\n\tat 
com.cloud.hypervisor.kvm.resource.LibvirtComputingResource.execut
e(LibvirtComputingResource.java:3672)\n\tat 
com.cloud.hypervisor.kvm.resource.LibvirtComputingResource.executeRequest(LibvirtComputingResource.java:1282)\n\tat
 com.cloud.agent.Agent.processRequest(Agent.java:498)\n\tat 
com.cloud.agent.Agent$AgentRequestHandler.doTask(Agent.java:806)\n\tat 
com.cloud.utils.nio.Task.run(Task.java:83)\n\tat 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)\n\tat
 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)\n\tat
 
java.lang.Thread.run(Thread.java:679)\n","wait":0}},{"com.cloud.agent.api.Answer":{"result":false,"details":"Stopped
 by previous 
failure","wait":0}},{"com.cloud.agent.api.Answer":{"result":false,"details":"Stopped
 by previous 
failure","wait":0}},{"com.cloud.agent.api.Answer":{"result":false,"details":"Stopped
 by previous 
failure","wait":0}},{"com.cloud.agent.api.Answer":{"result":false,"details":"Stopped
 by previous 
failure","wait":0}},{"com.cloud.agent.api.Answer":{"result":false,"details":"Stopped
 by previous failure","wait":0}}] }
2013-12-09 20:35:59,322 DEBUG [c.c.a.t.Request] (Job-Executor-26:ctx-0382e21d 
ctx-d8f9d323) Seq 1-539557989: Received:  { Ans: , MgmtId: 82324189320212, via: 
1, Ver: v1, Flags: 10, { Answer, Answer, Answer, Answer, Answer, Answer } }

6. Attempting to take snapshots also fails with following exception:

2013-12-09 20:54:10,509 DEBUG [c.c.a.t.Request] (AgentManager-Handler-10:null) 
Seq 2-1983250525: Processing:  { An
s: , MgmtId: 82324189320212, via: 2, Ver: v1, Flags: 10, 
[{"org.apache.cloudstack.storage.command.CreateObjectAnsw
er":{"result":false,"details":"com.cloud.utils.exception.CloudRuntimeException: 
java.lang.NullPointerException","w
ait":0}}] }
2013-12-09 20:54:10,509 DEBUG [c.c.a.t.Request] (Job-Executor-34:ctx-eb237191 
ctx-20bb478f) Seq 2-1983250525: Rece
ived:  { Ans: , MgmtId: 82324189320212, via: 2, Ver: v1, Flags: 10, { 
CreateObjectAnswer } }
2013-12-09 20:54:10,509 DEBUG [o.a.c.s.s.SnapshotServiceImpl] 
(Job-Executor-34:ctx-eb237191 ctx-20bb478f) create s
napshot TestVM-tiny-host-0ps-0-4_ROOT-49_20131210014410 failed: 
com.cloud.utils.exception.CloudRuntimeException: j
ava.lang.NullPointerException
2013-12-09 20:54:10,519 DEBUG [o.a.c.s.s.XenserverSnapshotStrategy] 
(Job-Executor-34:ctx-eb237191 ctx-20bb478f) Fa
iled to take snapshot: com.cloud.utils.exception.CloudRuntimeException: 
java.lang.NullPointerException
2013-12-09 20:54:10,536 DEBUG [c.c.s.s.SnapshotManagerImpl] 
(Job-Executor-34:ctx-eb237191 ctx-20bb478f) Failed to
create snapshot
com.cloud.utils.exception.CloudRuntimeException: 
com.cloud.utils.exception.CloudRuntimeException: java.lang.NullPo
interException
        at 
org.apache.cloudstack.storage.snapshot.XenserverSnapshotStrategy.takeSnapshot(XenserverSnapshotStrategy
.java:281)
        at 
com.cloud.storage.snapshot.SnapshotManagerImpl.takeSnapshot(SnapshotManagerImpl.java:951)
        at sun.reflect.GeneratedMethodAccessor230.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:317)
        at 
org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation
.java:183)
        at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:15
0)
        at 
org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java
:91)
        at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:17
2)
        at 
org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
        at $Proxy161.takeSnapshot(Unknown Source)
        at 
org.apache.cloudstack.storage.volume.VolumeServiceImpl.takeSnapshot(VolumeServiceImpl.java:1341)
        at 
com.cloud.storage.VolumeApiServiceImpl.takeSnapshot(VolumeApiServiceImpl.java:1461)
        at sun.reflect.GeneratedMethodAccessor229.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:317)
        at 
org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:183)
        at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:150)
        at 
org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:91)
        at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:172)
        at 
org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
        at $Proxy233.takeSnapshot(Unknown Source)
        at 
org.apache.cloudstack.api.command.user.snapshot.CreateSnapshotCmd.execute(CreateSnapshotCmd.java:181)
        at com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:161)
        at 
com.cloud.api.ApiAsyncJobDispatcher.runJobInContext(ApiAsyncJobDispatcher.java:109)
        at 
com.cloud.api.ApiAsyncJobDispatcher$1.run(ApiAsyncJobDispatcher.java:66)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
        at 
com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:63)
        at 
org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:520)
        at 
org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
        at 
org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
        at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
        at java.util.concurrent.FutureTask.run(FutureTask.java:166)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
        at java.lang.Thread.run(Thread.java:722)
2013-12-09 20:54:10,544 DEBUG [o.a.c.s.v.VolumeServiceImpl] 
(Job-Executor-34:ctx-eb237191 ctx-20bb478f) Take snapshot: 49 failed
com.cloud.utils.exception.CloudRuntimeException: Failed to create snapshot






--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Reply via email to