[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-19 Thread Strahil
>> >> 1.2 All bricks healed (gluster volume heal data info summary) and no >> >> split-brain >> > >> >   >> >   >> > gluster volume heal data info >> >   >> > Brick node-msk-gluster203:/opt/gluster/data >> > Status: Connected >> > Number of entries: 0 >> >   >> > Brick

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-19 Thread Николаев Алексей
Thx for your help, Strahil! Hmmm, I see DNS resolution failed in hostname without FQDN. I'll try to fix it. 19.03.2019, 09:43, "Strahil" :Hi Alexei,>> 1.2 All bricks healed (gluster volume heal data info summary) and no split-brain>>  >  > gluster volume heal data info>  > Brick

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-19 Thread Strahil
Hi Alexei, >> 1.2 All bricks healed (gluster volume heal data info summary) and no >> split-brain > >   >   > gluster volume heal data info >   > Brick node-msk-gluster203:/opt/gluster/data > Status: Connected > Number of entries: 0 >   > Brick node-msk-gluster205:/opt/gluster/data > > > > >

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-18 Thread Николаев Алексей
Thx for answer!   18.03.2019, 14:52, "Strahil Nikolov" : Hi Alexei, In order to debug it check the following: 1. Check gluster:1.1 All bricks up ? All peers up. Gluster version is 3.12.15 [root@node-msk-gluster203 ~]# gluster peer statusNumber of Peers: 2 Hostname: node-msk-gluster205.Uuid:

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-18 Thread Strahil Nikolov
Hi Alexei, In order to debug it check the following: 1. Check gluster:1.1 All bricks up ?1.2 All bricks healed (gluster volume heal data info summary) and no split-brain 2. Go to the problematic host and check the mount point is there2.1. Check permissions (should be vdsm:kvm) and fix with

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-18 Thread Николаев Алексей
Hi all! I have a very similar problem after update one of the two nodes to version 4.3.1. This node77-02 lost connection to gluster volume named DATA, but not to volume with hosted engine.  node77-02 /var/log/messages Mar 18 13:40:00 node77-02 journal: ovirt-ha-agent

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-17 Thread Strahil
Hi Simone, I have noticed that my Engine's root disk is 'vda' just in standalone KVM. I have the feeling that was not the case before. Can someone check a default engine and post the output of lsblk ? Thanks in advance. Best Regards, Strahil NikolovOn Mar 15, 2019 12:46, Strahil Nikolov

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-15 Thread Strahil Nikolov
On Fri, Mar 15, 2019 at 8:12 AM Strahil Nikolov wrote: Ok, I have managed to recover again and no issues are detected this time.I guess this case is quite rare and nobody has experienced that. >Hi,>can you please explain how you fixed it? I have set again to global maintenance, defined the

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-15 Thread Simone Tiraboschi
On Fri, Mar 15, 2019 at 8:12 AM Strahil Nikolov wrote: > Ok, > > I have managed to recover again and no issues are detected this time. > I guess this case is quite rare and nobody has experienced that. > Hi, can you please explain how you fixed it? > > Best Regards, > Strahil Nikolov > > В

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-15 Thread Strahil Nikolov
Ok, I have managed to recover again and no issues are detected this time.I guess this case is quite rare and nobody has experienced that. Best Regards,Strahil Nikolov В сряда, 13 март 2019 г., 13:03:38 ч. Гринуич+2, Strahil Nikolov написа: Dear Simone, it seems that there is some

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-13 Thread Strahil Nikolov
Dear Simone, it seems that there is some kind of problem ,as the OVF got updated with wrong configuration:[root@ovirt2 ~]# ls -l

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-12 Thread Strahil Nikolov
Dear Simone, it should be 60 min , but I have checked several hours after that and it didn't update it. [root@engine ~]# engine-config -g OvfUpdateIntervalInMinutes OvfUpdateIntervalInMinutes: 60 version: general How can i make a backup of the VM config , as you have noticed the local copy in

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-12 Thread Simone Tiraboschi
On Tue, Mar 12, 2019 at 9:48 AM Strahil Nikolov wrote: > Latest update - the system is back and running normally. > After a day (or maybe a little more), the OVF is OK: > Normally it should try every 60 minutes. Can you please execute engine-config -g OvfUpdateIntervalInMinutes on your engine

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-12 Thread Strahil Nikolov
Latest update - the system is back and running normally.After a day (or maybe a little more), the OVF is OK: [root@ovirt1 ~]# ls -l

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-09 Thread Strahil Nikolov
Hello again, Latest update: the engine is up and running (or at least the login portal). [root@ovirt1 ~]# hosted-engine --check-livelinessHosted Engine is up! I have found online the xml for the network: [root@ovirt1 ~]# cat ovirtmgmt_net.xml   vdsm-ovirtmgmt      Sadly, I had to create a

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-09 Thread Strahil Nikolov
Hi Simone, and thanks for your help. So far I found out that there is some problem with the local copy of the HostedEngine config (see attached part of vdsm.log). I have found out an older xml configuration (in an old vdsm.log) and defining the VM works, but powering it on reports: [root@ovirt1

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-08 Thread Simone Tiraboschi
On Fri, Mar 8, 2019 at 12:49 PM Strahil Nikolov wrote: > Hi Simone, > > sadly it seems that starting the engine from an alternative config is not > working. > Virsh reports that the VM is defined , but shut down and the dumpxml > doesn't show any disks - maybe this is normal for oVirt (I have

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-07 Thread Simone Tiraboschi
On Thu, Mar 7, 2019 at 2:54 PM Strahil Nikolov wrote: > > > > >The OVF_STORE volume is going to get periodically recreated by the engine > so at least you need a running engine. > > >In order to avoid this kind of issue we have two OVF_STORE disks, in your > case: > >

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-07 Thread Strahil Nikolov
>The OVF_STORE volume is going to get periodically recreated by the engine so >at least you need a running engine. >In order to avoid this kind of issue we have two OVF_STORE disks, in your case: >MainThread::INFO::2019-03-06

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-07 Thread Simone Tiraboschi
On Thu, Mar 7, 2019 at 9:19 AM Strahil Nikolov wrote: > Hi Simone, > > I think I found the problem - ovirt-ha cannot extract the file containing > the needed data . > In my case it is completely empty: > > > [root@ovirt1 ~]# ll >

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-07 Thread Strahil Nikolov
Hi Simone, I think I found the problem - ovirt-ha cannot extract the file containing the needed data .In my case it is completely empty: [root@ovirt1 ~]# ll

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-06 Thread Simone Tiraboschi
On Wed, Mar 6, 2019 at 3:09 PM Strahil Nikolov wrote: > Hi Simone, > > thanks for your reply. > > >Are you really sure that the issue was on the ping? > >on storage errors the broker restart itself and while the broker is > restarting >the agent cannot ask the broker to trigger the gateway

[ovirt-users] Re: Ovirt 4.3.1 problem with HA agent

2019-03-05 Thread Simone Tiraboschi
On Wed, Mar 6, 2019 at 6:13 AM Strahil wrote: > Hi guys, > > After updating to 4.3.1 I had an issue where the ovirt-ha-broker was > complaining that it couldn't ping the gateway. > Are you really sure that the issue was on the ping? on storage errors the broker restart itself and while the