Ryan, A null pointer exception is most definitely a bug. Can you file a jira ticket?
thanks, Daan On Thu, Oct 10, 2013 at 5:01 PM, Ryan James <r...@colocateusa.net> wrote: > We just upgraded to CloudStack 4.2 (from 4.0.2) and now our Xen Cluster will > not stay connected and the host are in alert states. > > > Here is a snip it out of the management-server.log > > > 2013-10-10 09:22:13,995 DEBUG [cloud.capacity.CapacityManagerImpl] > (AgentTaskPool-1:null) Found 6 VMs on host 4 > > 2013-10-10 09:22:14,003 ERROR [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Monitor ComputeCapacityListener says there is an error > in the connect process for 4 due to null > > java.lang.NullPointerException > > at > com.cloud.capacity.CapacityManagerImpl.updateCapacityForHost(CapacityManagerImpl.java:543) > > at > com.cloud.utils.component.ComponentInstantiationPostProcessor$InterceptorDispatcher.intercept(ComponentInstantiationPostProcessor.java:125) > > at > com.cloud.capacity.ComputeCapacityListener.processConnect(ComputeCapacityListener.java:78) > > at > com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:587) > > at > com.cloud.agent.manager.AgentManagerImpl.handleDirectConnectAgent(AgentManagerImpl.java:1479) > > at > com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1762) > > at > com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1924) > > at > com.cloud.agent.manager.AgentManagerImpl$SimulateStartTask.run(AgentManagerImpl.java:1130) > > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146) > > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > > at java.lang.Thread.run(Thread.java:679) > > 2013-10-10 09:22:14,004 INFO [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Host 4 is disconnecting with event AgentDisconnected > > 2013-10-10 09:22:14,008 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) The next status of agent 4is Alert, current status is > Connecting > > 2013-10-10 09:22:14,008 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Deregistering link for 4 with state Alert > > 2013-10-10 09:22:14,008 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Remove Agent : 4 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.DirectAgentAttache] > (AgentTaskPool-1:null) Processing disconnect 4 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.hypervisor.xen.discoverer.XcpServerDiscoverer_EnhancerByCloudStack_434ade97 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.deploy.DeploymentPlanningManagerImpl_EnhancerByCloudStack_a0f690d > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.NetworkManagerImpl_EnhancerByCloudStack_1ba07aa0 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.storage.secondary.SecondaryStorageListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.hypervisor.vmware.manager.VmwareManagerImpl_EnhancerByCloudStack_b315799a > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.security.SecurityGroupListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.storage.listener.StoragePoolMonitor > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.vm.ClusteredVirtualMachineManagerImpl_EnhancerByCloudStack_48612ba4 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.storage.LocalStoragePoolListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.SshKeysDistriMonitor > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.router.VirtualNetworkApplianceManagerImpl_EnhancerByCloudStack_e1d29845 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.SshKeysDistriMonitor > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.router.VpcVirtualNetworkApplianceManagerImpl_EnhancerByCloudStack_5cb66068 > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.storage.upload.UploadListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.storage.download.DownloadListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.agent.manager.AgentMonitor > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.capacity.StorageCapacityListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.capacity.ComputeCapacityListener > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.network.NetworkUsageManagerImpl$DirectNetworkStatsListener > > 2013-10-10 09:22:14,009 DEBUG [cloud.network.NetworkUsageManagerImpl] > (AgentTaskPool-1:null) Disconnected called on 4 with status Alert > > 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] > (AgentTaskPool-1:null) Sending Disconnect to listener: > com.cloud.consoleproxy.ConsoleProxyListener > > 2013-10-10 09:22:14,014 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) > Transition:[Resource state = Enabled, Agent event = AgentDisconnected, Host > id = 4, name = c14-c1-3] > > 2013-10-10 09:22:14,026 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) > Agent status update: [id = 4; name = c14-c1-3; old status = Connecting; event > = AgentDisconnected; new status = Alert; old update count = 2314; new update > count = 2315] > > 2013-10-10 09:22:14,026 DEBUG [agent.manager.ClusteredAgentManagerImpl] > (AgentTaskPool-1:null) Notifying other nodes of to disconnect > > 2013-10-10 09:22:14,029 WARN [cloud.resource.ResourceManagerImpl] > (AgentTaskPool-1:null) Unable to connect due to > > com.cloud.utils.exception.CloudRuntimeException: Unable to connect 4 > > at > com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:606) > > at > com.cloud.agent.manager.AgentManagerImpl.handleDirectConnectAgent(AgentManagerImpl.java:1479) > > at > com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1762) > > at > com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1924) > > at > com.cloud.agent.manager.AgentManagerImpl$SimulateStartTask.run(AgentManagerImpl.java:1130) > > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146) > > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > > at java.lang.Thread.run(Thread.java:679) > > Caused by: java.lang.NullPointerException > > at > com.cloud.capacity.CapacityManagerImpl.updateCapacityForHost(CapacityManagerImpl.java:543) > > at > com.cloud.utils.component.ComponentInstantiationPostProcessor$InterceptorDispatcher.intercept(ComponentInstantiationPostProcessor.java:125) > > at > com.cloud.capacity.ComputeCapacityListener.processConnect(ComputeCapacityListener.java:78) > > at > com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:587) > > ... 7 more > > 2013-10-10 09:22:14,030 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) > Transition:[Resource state = Enabled, Agent event = AgentDisconnected, Host > id = 4, name = c14-c1-3] > > 2013-10-10 09:22:14,041 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) > Agent status update: [id = 4; name = c14-c1-3; old status = Alert; event = > AgentDisconnected; new status = Alert; old update count = 2315; new update > count = 2316] > > > I have not been able to find any information online about this error or how > to get the cluster to connect again. The Cluster is up to date on Hot Fixes > and was working fine before the upgrade. > > The cluster is a 3 node cluster with fiber luns. > > Any help on this is greatly appreciated. > > -- > Ryan James > ColocateUSA > http://www.colocateUSA.net > r...@colocateusa.net