[
https://issues.apache.org/jira/browse/CLOUDSTACK-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Abhinandan Prateek updated CLOUDSTACK-4137:
-------------------------------------------
Priority: Critical (was: Major)
> KVM: After unmanaging cluster, manage cluster will not bring KVM hosts to UP
> state. cloud-agent on KVM hosts has to be restarted manually
> -----------------------------------------------------------------------------------------------------------------------------------------
>
> Key: CLOUDSTACK-4137
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-4137
> Project: CloudStack
> Issue Type: Bug
> Security Level: Public(Anyone can view this level - this is the
> default.)
> Components: KVM
> Affects Versions: 4.2.0
> Environment: hypervisor : KVM
> Reporter: prashant kumar mishra
> Assignee: Kishan Kavala
> Priority: Critical
> Fix For: 4.2.1
>
> Attachments: Logs_DB_Agent.rar
>
>
> Steps to reproduce
> ---------------------------
> 1-prepare a CS ->one cluster-->one kvm(rhel6.3)
> 2-unmanage cluster
> 3-manage cluster
> Expected
> --------------
> Host should come up , in running state
> Actual
> ----------
> Host remain in Disconnect state
> My observation
> -----------------------
> 1-after host went in disconnect state i performed host maintenance mode
> then cancel maintenance mode and host came up
> 2-restart cloud-agent on the kvm hosts bring the hosts to UP state
> Logs:
> ----------
> 2013-08-07 20:14:16,999 DEBUG [cloud.api.ApiServlet] (catalina-exec-23:null)
> ===START=== 10.252.192.53 -- GET
> command=updateCluster&id=854b547a-fbee-4eed-895d-b8ea96d1cc23&managedstate=Unmanaged&response=json&sessionkey=vOfsgLOiksyXg%2B23XuFp6maNm1I%3D&_=1375867173396
> 2013-08-07 20:14:17,031 DEBUG [agent.transport.Request]
> (catalina-exec-23:null) Seq 1-2042691611: Sending { Cmd , MgmtId:
> 6703101771911, via: 1, Ver: v1, Flags: 100111,
> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] }
> 2013-08-07 20:14:17,038 DEBUG [agent.transport.Request]
> (AgentManager-Handler-7:null) Seq 1-2042691611: Processing: { Ans: , MgmtId:
> 6703101771911, via: 1, Ver: v1, Flags: 110,
> [{"com.cloud.agent.api.MaintainAnswer":{"willMigrate":true,"result":true,"wait":0}}]
> }
> 2013-08-07 20:14:17,038 DEBUG [agent.manager.AgentAttache]
> (AgentManager-Handler-7:null) Seq 1-2042691611: No more commands found
> 2013-08-07 20:14:17,038 DEBUG [agent.transport.Request]
> (catalina-exec-23:null) Seq 1-2042691611: Received: { Ans: , MgmtId:
> 6703101771911, via: 1, Ver: v1, Flags: 110, { MaintainAnswer } }
> 2013-08-07 20:14:17,039 INFO [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Host 1 is disconnecting with event ShutdownRequested
> 2013-08-07 20:14:17,051 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) The next status of agent 1is Disconnected, current
> status is Up
> 2013-08-07 20:14:17,051 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Deregistering link for 1 with state Disconnected
> 2013-08-07 20:14:17,051 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Remove Agent : 1
> 2013-08-07 20:14:17,052 DEBUG [agent.manager.ConnectedAgentAttache]
> (AgentTaskPool-4:null) Processing Disconnect.
> 2013-08-07 20:14:17,052 DEBUG [agent.manager.AgentAttache]
> (AgentTaskPool-4:null) Seq 1-2042691586: Sending disconnect to class
> com.cloud.network.security.SecurityGroupListener
> 2013-08-07 20:14:17,052 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.hypervisor.xen.discoverer.XcpServerDiscoverer_EnhancerByCloudStack_eccb8bca
> 2013-08-07 20:14:17,053 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.deploy.DeploymentPlanningManagerImpl_EnhancerByCloudStack_b3901640
> 2013-08-07 20:14:17,053 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.NetworkManagerImpl_EnhancerByCloudStack_c52127d3
> 2013-08-07 20:14:17,053 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.storage.secondary.SecondaryStorageListener
> 2013-08-07 20:14:17,053 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.hypervisor.vmware.manager.VmwareManagerImpl_EnhancerByCloudStack_5c9626cd
> 2013-08-07 20:14:17,053 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.security.SecurityGroupListener
> 2013-08-07 20:14:17,053 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.storage.listener.StoragePoolMonitor
> 2013-08-07 20:14:17,054 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.vm.ClusteredVirtualMachineManagerImpl_EnhancerByCloudStack_f1e1d8d7
> 2013-08-07 20:14:17,054 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.storage.LocalStoragePoolListener
> 2013-08-07 20:14:17,054 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.SshKeysDistriMonitor
> 2013-08-07 20:14:17,054 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.router.VirtualNetworkApplianceManagerImpl_EnhancerByCloudStack_8b534578
> 2013-08-07 20:14:17,054 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.SshKeysDistriMonitor
> 2013-08-07 20:14:17,056 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.router.VpcVirtualNetworkApplianceManagerImpl_EnhancerByCloudStack_6370d9b
> 2013-08-07 20:14:17,056 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.storage.upload.UploadListener
> 2013-08-07 20:14:17,056 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.storage.download.DownloadListener
> 2013-08-07 20:14:17,056 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.agent.manager.AgentMonitor
> 2013-08-07 20:14:17,057 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.capacity.StorageCapacityListener
> 2013-08-07 20:14:17,057 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.capacity.ComputeCapacityListener
> 2013-08-07 20:14:17,058 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.network.NetworkUsageManagerImpl$DirectNetworkStatsListener
> 2013-08-07 20:14:17,058 DEBUG [cloud.network.NetworkUsageManagerImpl]
> (AgentTaskPool-4:null) Disconnected called on 1 with status Disconnected
> 2013-08-07 20:14:17,058 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-4:null) Sending Disconnect to listener:
> com.cloud.consoleproxy.ConsoleProxyListener
> 2013-08-07 20:14:17,061 DEBUG [cloud.host.Status] (AgentTaskPool-4:null)
> Transition:[Resource state = Enabled, Agent event = ShutdownRequested, Host
> id = 1, name = Rack1Pod1Host18]
> 2013-08-07 20:14:17,081 DEBUG [cloud.host.Status] (AgentTaskPool-4:null)
> Agent status update: [id = 1; name = Rack1Pod1Host18; old status = Up; event
> = ShutdownRequested; new status = Disconnected; old update count = 7; new
> update count = 8]
> 2013-08-07 20:14:17,082 DEBUG [agent.manager.ClusteredAgentManagerImpl]
> (AgentTaskPool-4:null) Notifying other nodes of to disconnect
> 2013-08-07 20:14:21,077 DEBUG [storage.secondary.SecondaryStorageManagerImpl]
> (secstorage-1:null) Zone 1 is not ready to launch secondary storage VM yet
> 2013-08-07 20:14:21,588 DEBUG [cloud.consoleproxy.ConsoleProxyManagerImpl]
> (consoleproxy-1:null) Zone 1 is not ready to launch console proxy yet
> 2013-08-07 20:15:31,485 INFO [agent.manager.AgentMonitor] (Thread-6:null)
> Found the following agents behind on ping: [2]
> 2013-08-07 20:15:31,489 DEBUG [cloud.host.Status] (Thread-6:null) Ping
> timeout for host 2, do invstigation
> 2013-08-07 20:15:31,494 INFO [agent.manager.AgentManagerImpl]
> (AgentTaskPool-5:null) Investigating why host 2 has disconnected with event
> PingTimeout
> 2013-08-07 20:15:31,496 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-5:null) checking if agent (2) is alive
> 2013-08-07 20:15:31,501 DEBUG [agent.transport.Request]
> (AgentTaskPool-5:null) Seq 2-55508996: Sending { Cmd , MgmtId:
> 6703101771911, via: 2, Ver: v1, Flags: 100011,
> [{"com.cloud.agent.api.CheckHealthCommand":{"wait":50}}] }
> 2013-08-07 20:15:31,549 DEBUG [agent.transport.Request]
> (AgentManager-Handler-8:null) Seq 2-55508996: Processing: { Ans: , MgmtId:
> 6703101771911, via: 2, Ver: v1, Flags: 10,
> [{"com.cloud.agent.api.CheckHealthAnswer":{"result":true,"details":"resource
> is alive","wait":0}}] }
> 2013-08-07 20:15:31,549 DEBUG [agent.transport.Request]
> (AgentTaskPool-5:null) Seq 2-55508996: Received: { Ans: , MgmtId:
> 6703101771911, via: 2, Ver: v1, Flags: 10, { CheckHealthAnswer } }
> 2013-08-07 20:15:31,549 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-5:null) Details from executing class
> com.cloud.agent.api.CheckHealthCommand: resource is alive
> 2013-08-07 20:15:31,550 DEBUG [agent.manager.AgentManagerImpl]
> (AgentTaskPool-5:null) agent (2) responded to checkHeathCommand, reporting
> that agent is Up
> 2013-08-07 20:15:31,550 INFO [agent.manager.AgentManagerImpl]
> (AgentTaskPool-5:null) The state determined is Up
> 2013-08-07 20:15:31,550 INFO [agent.manager.AgentManagerImpl]
> (AgentTaskPool-5:null) Agent is determined to be up and running
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira