manasaveloori created CLOUDSTACK-5706:
-----------------------------------------
Summary: Multiple NPEs when host is put in maintenance after
upgrading from 4.2.1 to 4.3
Key: CLOUDSTACK-5706
URL: https://issues.apache.org/jira/browse/CLOUDSTACK-5706
Project: CloudStack
Issue Type: Bug
Security Level: Public (Anyone can view this level - this is the default.)
Components: Upgrade
Affects Versions: 4.3.0
Environment: upgraded the CS from 4.2.1 to 4.3
Reporter: manasaveloori
Priority: Critical
Fix For: 4.3.0
Steps:
1. Deployed CS 4.2.1 GA build with ESXi5.1(using Vsphere 5.1 client)
2. Deployed some VMs.
3. Upgraded the CS to 4.3.
4. Put the host into maintenance---- which will be successful.
Observed the following NPEs in MSlogs.
2014-01-01 20:17:11,749 DEBUG [c.c.a.ApiServlet] (catalina-exec-15:ctx-e3555589
ctx-1ab4f80c) ===END=== 10.252.192.34 -- GET
command=prepareHostForMaintenance&id=dbd66102-e51e-4014-be75-e7ac8ccb81f1&response=json&sessionkey=ble18kT0VfY%2BEPq8V8DzAZsM8L0%3D&_=1388568267594
2014-01-01 20:17:11,752 INFO [o.a.c.f.j.i.AsyncJobMonitor]
(Job-Executor-118:ctx-e4cb1a1d) Add job-166 into job monitoring
2014-01-01 20:17:11,752 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d) Executing AsyncJobVO {id:166, userId: 2,
accountId: 2, instanceType: Host, instanceId: 4, cmd:
org.apache.cloudstack.api.command.admin.host.PrepareForMaintenanceCmd, cmdInfo:
{"response":"json","id":"dbd66102-e51e-4014-be75-e7ac8ccb81f1","sessionkey":"ble18kT0VfY+EPq8V8DzAZsM8L0\u003d","cmdEventType":"MAINT.PREPARE","ctxUserId":"2","httpmethod":"GET","_":"1388568267594","ctxAccountId":"2","ctxStartEventId":"758"},
cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0, result:
null, initMsid: 6758231703598, completeMsid: null, lastUpdated: null,
lastPolled: null, created: null}
2014-01-01 20:17:11,782 DEBUG [c.c.a.t.Request] (Job-Executor-118:ctx-e4cb1a1d
ctx-1ab4f80c) Seq 4-1399914503: Sending { Cmd , MgmtId: 6758231703598, via:
4(10.147.40.28), Ver: v1, Flags: 100111,
[{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] }
2014-01-01 20:17:11,782 DEBUG [c.c.a.t.Request] (Job-Executor-118:ctx-e4cb1a1d
ctx-1ab4f80c) Seq 4-1399914503: Executing: { Cmd , MgmtId: 6758231703598, via:
4(10.147.40.28), Ver: v1, Flags: 100111,
[{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] }
2014-01-01 20:17:11,806 DEBUG [c.c.a.m.DirectAgentAttache]
(DirectAgent-410:ctx-32cc846b) Seq 4-1399914503: Executing request
2014-01-01 20:17:11,807 INFO [c.c.h.v.r.VmwareResource]
(DirectAgent-410:ctx-32cc846b 10.147.40.28) Executing resource MaintainCommand:
{"wait":0}
2014-01-01 20:17:11,814 DEBUG [c.c.a.m.DirectAgentAttache]
(DirectAgent-410:ctx-32cc846b) Seq 4-1399914503: Response Received:
2014-01-01 20:17:11,814 DEBUG [c.c.a.t.Request] (DirectAgent-410:ctx-32cc846b)
Seq 4-1399914503: Processing: { Ans: , MgmtId: 6758231703598, via: 4, Ver: v1,
Flags: 110,
[{"com.cloud.agent.api.MaintainAnswer":{"willMigrate":true,"result":true,"details":"Put
host in maintaince","wait":0}}] }
2014-01-01 20:17:11,815 DEBUG [c.c.a.t.Request] (Job-Executor-118:ctx-e4cb1a1d
ctx-1ab4f80c) Seq 4-1399914503: Received: { Ans: , MgmtId: 6758231703598, via:
4, Ver: v1, Flags: 110, { MaintainAnswer } }
2014-01-01 20:17:11,815 DEBUG [c.c.a.m.AgentManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Details from executing class
com.cloud.agent.api.MaintainCommand: Put host in maintaince
2014-01-01 20:17:11,817 DEBUG [c.c.a.m.AgentAttache]
(DirectAgent-410:ctx-32cc846b) Seq 4-1399914503: No more commands found
2014-01-01 20:17:11,834 DEBUG [c.c.r.ResourceState]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Resource state update: [id = 4;
name = 10.147.40.28; old state = Enabled; event = AdminAskMaintenace; new state
= PrepareForMaintenance]
2014-01-01 20:17:11,834 DEBUG [c.c.a.m.AgentAttache]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Seq 4-1399914499: Sending
disconnect to class com.cloud.network.security.SecurityGroupListener
2014-01-01 20:17:11,888 DEBUG [c.c.h.HighAvailabilityManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Scheduled
HAWork[44-ForceStop-9-Running-Scheduled]
2014-01-01 20:17:11,910 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-07bb080c work-44) Processing
HAWork[44-ForceStop-9-Running-Scheduled]
2014-01-01 20:17:11,917 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-07bb080c work-44) Stopping VM[DomainRouter|r-9-VM]
2014-01-01 20:17:11,920 ERROR [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-07bb080c work-44) Terminating
HAWork[44-ForceStop-9-Running-Scheduled]
java.lang.NullPointerException
at
com.cloud.vm.VirtualMachineManagerImpl.advanceStop(VirtualMachineManagerImpl.java:1264)
at
com.cloud.ha.HighAvailabilityManagerImpl.stopVM(HighAvailabilityManagerImpl.java:692)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.runWithContext(HighAvailabilityManagerImpl.java:869)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.access$000(HighAvailabilityManagerImpl.java:822)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread$1.run(HighAvailabilityManagerImpl.java:834)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.run(HighAvailabilityManagerImpl.java:831)
2014-01-01 20:17:11,919 DEBUG [c.c.h.HighAvailabilityManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Scheduled
HAWork[45-ForceStop-10-Running-Scheduled]
2014-01-01 20:17:11,926 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-1:ctx-ccc388fe work-45) Processing
HAWork[45-ForceStop-10-Running-Scheduled]
2014-01-01 20:17:11,944 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-1:ctx-ccc388fe work-45) Stopping VM[User|tiervmBU]
2014-01-01 20:17:11,949 ERROR [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-1:ctx-ccc388fe work-45) Terminating
HAWork[45-ForceStop-10-Running-Scheduled]
java.lang.NullPointerException
at
com.cloud.vm.VirtualMachineManagerImpl.advanceStop(VirtualMachineManagerImpl.java:1264)
at
com.cloud.ha.HighAvailabilityManagerImpl.stopVM(HighAvailabilityManagerImpl.java:692)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.runWithContext(HighAvailabilityManagerImpl.java:869)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.access$000(HighAvailabilityManagerImpl.java:822)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread$1.run(HighAvailabilityManagerImpl.java:834)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.run(HighAvailabilityManagerImpl.java:831)
2014-01-01 20:17:11,957 DEBUG [c.c.h.HighAvailabilityManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Scheduled
HAWork[46-ForceStop-13-Running-Scheduled]
2014-01-01 20:17:11,975 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-c4ef9c26 work-46) Processing
HAWork[46-ForceStop-13-Running-Scheduled]
2014-01-01 20:17:11,985 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-c4ef9c26 work-46) Stopping VM[ConsoleProxy|v-13-VM]
2014-01-01 20:17:11,985 ERROR [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-c4ef9c26 work-46) Terminating
HAWork[46-ForceStop-13-Running-Scheduled]
java.lang.NullPointerException
at
com.cloud.vm.VirtualMachineManagerImpl.advanceStop(VirtualMachineManagerImpl.java:1264)
at
com.cloud.ha.HighAvailabilityManagerImpl.stopVM(HighAvailabilityManagerImpl.java:692)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.runWithContext(HighAvailabilityManagerImpl.java:869)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.access$000(HighAvailabilityManagerImpl.java:822)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread$1.run(HighAvailabilityManagerImpl.java:834)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.run(HighAvailabilityManagerImpl.java:831)
2014-01-01 20:17:11,989 DEBUG [c.c.h.HighAvailabilityManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Scheduled
HAWork[47-ForceStop-14-Running-Scheduled]
2014-01-01 20:17:12,008 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-1:ctx-6541a513 work-47) Processing
HAWork[47-ForceStop-14-Running-Scheduled]
2014-01-01 20:17:12,011 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-1:ctx-6541a513 work-47) Stopping VM[SecondaryStorageVm|s-14-VM]
2014-01-01 20:17:12,012 ERROR [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-1:ctx-6541a513 work-47) Terminating
HAWork[47-ForceStop-14-Running-Scheduled]
java.lang.NullPointerException
at
com.cloud.vm.VirtualMachineManagerImpl.advanceStop(VirtualMachineManagerImpl.java:1264)
at
com.cloud.ha.HighAvailabilityManagerImpl.stopVM(HighAvailabilityManagerImpl.java:692)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.runWithContext(HighAvailabilityManagerImpl.java:869)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.access$000(HighAvailabilityManagerImpl.java:822)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread$1.run(HighAvailabilityManagerImpl.java:834)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.run(HighAvailabilityManagerImpl.java:831)
2014-01-01 20:17:12,020 DEBUG [c.c.h.HighAvailabilityManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Scheduled
HAWork[48-ForceStop-15-Running-Scheduled]
2014-01-01 20:17:12,038 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-b75f1173 work-48) Processing
HAWork[48-ForceStop-15-Running-Scheduled]
2014-01-01 20:17:12,045 INFO [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-b75f1173 work-48) Stopping
VM[User|VM-8d5bebec-4c34-4b74-aeca-9b241e82f08a]
2014-01-01 20:17:12,047 ERROR [c.c.h.HighAvailabilityManagerImpl]
(HA-Worker-0:ctx-b75f1173 work-48) Terminating
HAWork[48-ForceStop-15-Running-Scheduled]
java.lang.NullPointerException
at
com.cloud.vm.VirtualMachineManagerImpl.advanceStop(VirtualMachineManagerImpl.java:1264)
at
com.cloud.ha.HighAvailabilityManagerImpl.stopVM(HighAvailabilityManagerImpl.java:692)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.runWithContext(HighAvailabilityManagerImpl.java:869)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.access$000(HighAvailabilityManagerImpl.java:822)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread$1.run(HighAvailabilityManagerImpl.java:834)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
at
com.cloud.ha.HighAvailabilityManagerImpl$WorkerThread.run(HighAvailabilityManagerImpl.java:831)
2014-01-01 20:17:12,073 DEBUG [c.c.r.ResourceManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Sent resource event
EVENT_PREPARE_MAINTENANCE_AFTER to listener CapacityManagerImpl
2014-01-01 20:17:12,088 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d ctx-1ab4f80c) Complete async job-166, jobStatus:
SUCCEEDED, resultCode: 0, result:
org.apache.cloudstack.api.response.HostResponse/host/{"id":"dbd66102-e51e-4014-be75-e7ac8ccb81f1","name":"10.147.40.28","state":"Up","disconnected":"2013-12-31T17:12:03+0530","type":"Routing","ipaddress":"10.147.40.28","zoneid":"4daf5bda-ff3e-42b2-97e3-936d25ba57ef","zonename":"manasa","podid":"b0e11a62-bc03-4a5f-9056-34f7da627b28","podname":"podVMw","version":"4.3.0-SNAPSHOT","hypervisor":"VMware","cpusockets":1,"cpunumber":4,"cpuspeed":2394,"cpuallocated":"0%","cpuused":"0%","cpuwithoverprovisioning":"9576.0","networkkbsread":0,"networkkbswrite":0,"memorytotal":17169539072,"memoryallocated":0,"memoryused":0,"capabilities":"hvm","lastpinged":"1970-01-16T22:10:42+0530","managementserverid":6758231703598,"clusterid":"16b91cfa-6ba3-485d-b2d6-bfc9153cd4dc","clustername":"10.147.60.9/manasa/clusterVMw","clustertype":"ExternalManaged","islocalstorageactive":false,"created":"2013-12-30T22:52:13+0530","events":"AgentConnected;
StartAgentRebalance; Ping; AgentDisconnected; HostDown; PingTimeout;
ShutdownRequested; ManagementServerDown;
Remove","resourcestate":"PrepareForMaintenance","hypervisorversion":"5.1","hahost":false,"jobid":"60fed810-c048-4759-9ae2-9e8227bc4583","jobstatus":0}
2014-01-01 20:17:12,102 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
(Job-Executor-118:ctx-e4cb1a1d) Done executing
org.apache.cloudstack.api.command.admin.host.PrepareForMaintenanceCmd for
job-166.
Attaching the Ms logs:
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)