Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 12:37:44 -0400 Robert wrote: RS> All 3 hosts that had down ha-agents were down again, so I'm guessing RS> that's the issue.. As an experiment, I migrated the engine VM to one of the hosts with a working ha-agent process, and I'm no longer getting getting these emails. So the

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 16:00:27 +0100 Simone wrote: ST> And indeed ares was host 1 so when it failed it was correctly trying to ST> get lock for host 1 but it seams that previously it acquired a lock as ST> different host. ST> Could you please check ST> grep host_id /etc/ovirt-hosted-engine/hosted-en

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Simone Tiraboschi
On Thu, Oct 29, 2015 at 3:47 PM, Robert Story wrote: > On Thu, 29 Oct 2015 15:40:23 +0100 Simone wrote: > ST> Here the host IDs seam coherent. > ST> Can you please specify the name of the hosts where you took the logs in > ST> your first log archive (complaining host and engine host) ? > > Hmm..

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 15:40:23 +0100 Simone wrote: ST> Here the host IDs seam coherent. ST> Can you please specify the name of the hosts where you took the logs in ST> your first log archive (complaining host and engine host) ? Hmm.. I know the complaining host was posedion, and I'm pretty sure the

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Simone Tiraboschi
On Thu, Oct 29, 2015 at 2:52 PM, Robert Story wrote: > On Thu, 29 Oct 2015 14:08:22 +0100 Simone wrote: > ST> it seams that two hosts are fighting fir the same host ID: > ST> > ST> MainThread::INFO::2015-10-27 > ST> > 09:14:56,764::hosted_engine::562::ovirt_hosted_engine_ha.agent.hosted_engine.Ho

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 14:08:22 +0100 Simone wrote: ST> it seams that two hosts are fighting fir the same host ID: ST> ST> MainThread::INFO::2015-10-27 ST> 09:14:56,764::hosted_engine::562::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_sanlock) ST> Ensuring lease for lockspac

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Simone Tiraboschi
Hi Robert, it seams that two hosts are fighting fir the same host ID: MainThread::INFO::2015-10-27 09:14:56,764::hosted_engine::562::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_sanlock) Ensuring lease for lockspace hosted-engine, host id 1 is acquired (file: /var/run/vdsm

Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Tue, 27 Oct 2015 09:45:28 -0400 Robert wrote: RS> I have oVirt 3.5.4 on CentOS 7.1 hosts, and everyone once in a while RS> one of my hosts starts sending me the 4 engine status messages above RS> about every 10-15 minutes. I upgraded the engine and all hosts to 3.5.5, and then 2 hosts started s

[ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-27 Thread Robert Story
Hi, I have oVirt 3.5.4 on CentOS 7.1 hosts, and everyone once in a while one of my hosts starts sending me the 4 engine status messages above about every 10-15 minutes. It looks like the ha broker on the host currently running is having issues (already tried restarting it once. I've attached a t