[Users] PROBLEM - host is restarted aromatically by ovirt and becomes unresponsible
Good afternoon, I've having a problem with my hosts, at least one time per week the host that has all the VMs running restarts and becomes unresponsible. After the restart sent to the ilo by the ovirt engine the host becomes unresponsible, the fans on the enclosure go up like crazy. Then the only way to get the blade up is to stop it using ilo or onboard administrator, and then remove it from the enclosure and put it back in and then issue the start using ovirt gui, because using stop/start on the ilo or onboard administrator the blade powers up but becomes unresponsible, doesn't show any image or any boot post messages. Anyone else seen this problem before? BLADE ENCLOSURE: HP BladeSystem c3000 BLADES: HP BL460c G6 OS: CentOS 6.4 (64 bits) OVIRT: 3.2 engine.log: 2013-08-07 14:38:47,256 INFO [org.ovirt.engine.core.bll.storage.SetStoragePoolStatusCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] Running command: SetStoragePoolStatusCommand interna l: true. Entities affected : ID: 06951dba-556b-4323-9356-819c9160fe8e Type: StoragePool 2013-08-07 14:38:47,257 ERROR [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-8) vds::refreshVdsStats Failed getVdsStats, vds = 44d77dcb-b775-4aef-ae59- 1dea8d5c691a : blade5, error = VDSNetworkException: java.net.NoRouteToHostException: No route to host 2013-08-07 14:38:47,263 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler_Worker-8) ResourceManager::refreshVdsRunTimeInfo::Failed to refresh VDS , vds = 44d77dcb-b77 5-4aef-ae59-1dea8d5c691a : blade5, VDS Network Error, continuing. java.net.NoRouteToHostException: No route to host 2013-08-07 14:38:50,252 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] IrsBroker::Failed::GetStoragePoolInfoVDS due to: NoRout eToHostException: No route to host 2013-08-07 14:38:50,253 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler_Worker-10) [2b4cb7c5] ResourceManager::refreshVdsRunTimeInfo::Failed to refresh VDS , vds = 44d77dcb-b775-4aef-ae59-1dea8d5c691a : blade5, VDS Network Error, continuing. java.net.NoRouteToHostException: No route to host 2013-08-07 14:38:53,252 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] Irs placed on server 44d77dcb-b775-4aef-ae59-1dea8d5c69 1a failed. Proceed Failover 2013-08-07 14:38:53,254 ERROR [org.ovirt.engine.core.vdsbroker.VdsManager] (DefaultQuartzScheduler_Worker-4) VDS::handleNetworkException Server failed to respond, vds_id = 44d77dcb-b775-4aef -ae59-1dea8d5c691a, vds_name = blade5, error = java.net.NoRouteToHostException: No route to host 2013-08-07 14:38:53,296 INFO [org.ovirt.engine.core.bll.VdsEventListener] (pool-3-thread-47) ResourceManager::vdsNotResponding entered for Host 44d77dcb-b775-4aef-ae59-1dea8d5c691a, 192.168. 10.25 2013-08-07 14:38:53,299 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] hostFromVds::selectedVds - blade6, spmStatus Free, stor age pool VI-DataCenter 2013-08-07 14:38:53,308 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] SPM Init: could not find reported vds or not up - pool: VI-DataCenter vds_spm_id: 1 2013-08-07 14:38:53,346 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] SPM selection - vds seems as spm blade5 2013-08-07 14:38:53,355 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-7) [3b761f63] spm vds is non responsive, stopping spm selection. 2013-08-07 14:38:53,438 INFO [org.ovirt.engine.core.bll.FenceExecutor] (pool-3-thread-47) Using Host blade6 from CLUSTER as proxy to execute Restart command on Host blade5 2013-08-07 14:38:53,438 INFO [org.ovirt.engine.core.bll.FenceExecutor] (pool-3-thread-47) Executing Status Power Management command, Proxy Host:blade6, Agent:ilo, Target Host:blade5, Manag ement IP:ilo5.vi.pt, User:Administrator, Options:secure=true 2013-08-07 14:38:53,457 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (pool-3-thread-47) START, FenceVdsVDSCommand(HostName = blade6, HostId = 2530f498-6029-496a-ab42-9 24ca2e3eb7f, targetVdsId = 44d77dcb-b775-4aef-ae59-1dea8d5c691a, action = Status, ip = ilo5.vi.pt, port = , type = ilo, user = Administrator, password = **, options = 'secure=true'), log id: 41a729f3 2013-08-07 14:39:02,533 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (pool-3-thread-47) FINISH, FenceVdsVDSCommand, return: Test Succeeded, Host Status is: on, log id: 41a729f3 2013-08-07 14:39:02,541 INFO [org.ovirt.engine.core.bll.VdsNotRespondingTreatmentCommand] (pool-3-thread-47) Running command: VdsNotRespondingTreatmentCommand internal: true. Entities affect ed : ID: 44d77dcb-b775-4aef-ae59-1dea8d5c691a Type: VDS 2013-08-07
Re: [Users] PROBLEM - host is restarted aromatically by ovirt and becomes unresponsible
- Original Message - | From: Ricardo Esteves maverick...@gmail.com | To: Users@ovirt.org | Sent: Wednesday, August 7, 2013 5:43:26 PM | Subject: [Users] PROBLEM - host is restarted aromatically by ovirt and becomes unresponsible | | Good afternoon, | | I've having a problem with my hosts, at least one time per week the host that | has all the VMs running restarts and becomes unresponsible. | | After the restart sent to the ilo by the ovirt engine the host becomes | unresponsible, the fans on the enclosure go up like crazy. | | Then the only way to get the blade up is to stop it using ilo or onboard | administrator, | and then remove it from the enclosure and put it back in and then issue the | start using ovirt gui, because using stop/start on the ilo or onboard | administrator the blade powers up but becomes unresponsible, doesn't show | any image or any boot post messages. | | | Anyone else seen this problem before? | | BLADE ENCLOSURE: HP BladeSystem c3000 | BLADES: HP BL460c G6 | OS: CentOS 6.4 (64 bits) | OVIRT: 3.2 | | | engine.log: | | 2013-08-07 14:38:47,256 INFO | [org.ovirt.engine.core.bll.storage.SetStoragePoolStatusCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] Running command: | SetStoragePoolStatusCommand interna | l: true. Entities affected : ID: 06951dba-556b-4323-9356-819c9160fe8e Type: | StoragePool | 2013-08-07 14:38:47,257 ERROR | [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] | (DefaultQuartzScheduler_Worker-8) vds::refreshVdsStats Failed getVdsStats, | vds = 44d77dcb-b775-4aef-ae59- | 1dea8d5c691a : blade5, error = VDSNetworkException: | java.net.NoRouteToHostException: No route to host | 2013-08-07 14:38:47,263 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] | (DefaultQuartzScheduler_Worker-8) | ResourceManager::refreshVdsRunTimeInfo::Failed to refresh VDS , vds = | 44d77dcb-b77 | 5-4aef-ae59-1dea8d5c691a : blade5, VDS Network Error, continuing. | java.net.NoRouteToHostException: No route to host | 2013-08-07 14:38:50,252 ERROR | [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] | IrsBroker::Failed::GetStoragePoolInfoVDS due to: NoRout | eToHostException: No route to host | 2013-08-07 14:38:50,253 WARN [org.ovirt.engine.core.vdsbroker.VdsManager] | (DefaultQuartzScheduler_Worker-10) [2b4cb7c5] | ResourceManager::refreshVdsRunTimeInfo::Failed to refresh VDS , vds = | 44d77dcb-b775-4aef-ae59-1dea8d5c691a : blade5, VDS Network Error, continuing. | java.net.NoRouteToHostException: No route to host | 2013-08-07 14:38:53,252 INFO | [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] Irs placed on server | 44d77dcb-b775-4aef-ae59-1dea8d5c69 | 1a failed. Proceed Failover | 2013-08-07 14:38:53,254 ERROR [org.ovirt.engine.core.vdsbroker.VdsManager] | (DefaultQuartzScheduler_Worker-4) VDS::handleNetworkException Server failed | to respond, vds_id = 44d77dcb-b775-4aef | -ae59-1dea8d5c691a, vds_name = blade5, error = | java.net.NoRouteToHostException: No route to host | 2013-08-07 14:38:53,296 INFO [org.ovirt.engine.core.bll.VdsEventListener] | (pool-3-thread-47) ResourceManager::vdsNotResponding entered for Host | 44d77dcb-b775-4aef-ae59-1dea8d5c691a, 192.168. | 10.25 | 2013-08-07 14:38:53,299 INFO | [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] hostFromVds::selectedVds - | blade6, spmStatus Free, stor | age pool VI-DataCenter | 2013-08-07 14:38:53,308 ERROR | [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] SPM Init: could not find | reported vds or not up - pool: | VI-DataCenter vds_spm_id: 1 | 2013-08-07 14:38:53,346 INFO | [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] SPM selection - vds seems as | spm blade5 | 2013-08-07 14:38:53,355 WARN | [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] | (DefaultQuartzScheduler_Worker-7) [3b761f63] spm vds is non responsive, | stopping spm selection. | 2013-08-07 14:38:53,438 INFO [org.ovirt.engine.core.bll.FenceExecutor] | (pool-3-thread-47) Using Host blade6 from CLUSTER as proxy to execute | Restart command on Host blade5 | 2013-08-07 14:38:53,438 INFO [org.ovirt.engine.core.bll.FenceExecutor] | (pool-3-thread-47) Executing Status Power Management command, Proxy | Host:blade6, Agent:ilo, Target Host:blade5, Manag | ement IP:ilo5.vi.pt, User:Administrator, Options:secure=true | 2013-08-07 14:38:53,457 INFO | [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] | (pool-3-thread-47) START, FenceVdsVDSCommand(HostName = blade6, HostId = | 2530f498-6029-496a-ab42-9 | 24ca2e3eb7f, targetVdsId = 44d77dcb-b775-4aef-ae59-1dea8d5c691a, action = | Status, ip = ilo5.vi.pt, port = , type = ilo, user = Administrator, password | = **, options = 'secure=true'), log | id: 41a729f3 | 2013-08-07 14:39:02,533 INFO