Thanks Martin. As you suggested I updated hosted-engine.conf with correct host_id values and restarted ovirt-ha-agent services on both hosts and now I run into the problem with status "unknown-stale-data" :( And second host still doesn't looks as capable to run HE.
Should I stop HE VM, bring down ovirt-ha-agents and reinitialize-lockspace and start ovirt-ha-agents again? Regards, Artem On Mon, Feb 19, 2018 at 6:45 PM, Martin Sivak <msi...@redhat.com> wrote: > Hi Artem, > > just a restart of ovirt-ha-agent services should be enough. > > Best regards > > Martin Sivak > > On Mon, Feb 19, 2018 at 4:40 PM, Artem Tambovskiy > <artem.tambovs...@gmail.com> wrote: > > Ok, understood. > > Once I set correct host_id on both hosts how to take changes in force? > With > > minimal downtime? Or i need reboot both hosts anyway? > > > > Regards, > > Artem > > > > 19 февр. 2018 г. 18:18 пользователь "Simone Tiraboschi" > > <stira...@redhat.com> написал: > > > >> > >> > >> On Mon, Feb 19, 2018 at 4:12 PM, Artem Tambovskiy > >> <artem.tambovs...@gmail.com> wrote: > >>> > >>> > >>> Thanks a lot, Simone! > >>> > >>> This is clearly shows a problem: > >>> > >>> [root@ov-eng ovirt-engine]# sudo -u postgres psql -d engine -c 'select > >>> vds_name, vds_spm_id from vds' > >>> vds_name | vds_spm_id > >>> -----------------+------------ > >>> ovirt1.local | 2 > >>> ovirt2.local | 1 > >>> (2 rows) > >>> > >>> While hosted-engine.conf on ovirt1.local have host_id=1, and > ovirt2.local > >>> host_id=2. So totally opposite values. > >>> So how to get this fixed in the simple way? Update the engine DB? > >> > >> > >> I'd suggest to manually fix /etc/ovirt-hosted-engine/hosted-engine.conf > on > >> both the hosts > >> > >>> > >>> > >>> Regards, > >>> Artem > >>> > >>> On Mon, Feb 19, 2018 at 5:37 PM, Simone Tiraboschi < > stira...@redhat.com> > >>> wrote: > >>>> > >>>> > >>>> > >>>> On Mon, Feb 19, 2018 at 12:13 PM, Artem Tambovskiy > >>>> <artem.tambovs...@gmail.com> wrote: > >>>>> > >>>>> Hello, > >>>>> > >>>>> Last weekend my cluster suffered form a massive power outage due to > >>>>> human mistake. > >>>>> I'm using SHE setup with Gluster, I managed to bring the cluster up > >>>>> quickly, but once again I have a problem with duplicated host_id > >>>>> (https://bugzilla.redhat.com/show_bug.cgi?id=1543988) on second > host and due > >>>>> to this second host is not capable to run HE. > >>>>> > >>>>> I manually updated file hosted_engine.conf with correct host_id and > >>>>> restarted agent & broker - no effect. Than I rebooted the host > itself - > >>>>> still no changes. How to fix this issue? > >>>> > >>>> > >>>> I'd suggest to run this command on the engine VM: > >>>> sudo -u postgres scl enable rh-postgresql95 -- psql -d engine -c > >>>> 'select vds_name, vds_spm_id from vds' > >>>> (just sudo -u postgres psql -d engine -c 'select vds_name, vds_spm_id > >>>> from vds' if still on 4.1) and check > >>>> /etc/ovirt-hosted-engine/hosted-engine.conf on all the involved host. > >>>> Maybe you can also have a leftover configuration file on undeployed > >>>> host. > >>>> > >>>> When you find a conflict you should manually bring down sanlock > >>>> In doubt a reboot of both the hosts will solve for sure. > >>>> > >>>> > >>>>> > >>>>> > >>>>> Regards, > >>>>> Artem > >>>>> > >>>>> _______________________________________________ > >>>>> Users mailing list > >>>>> Users@ovirt.org > >>>>> http://lists.ovirt.org/mailman/listinfo/users > >>>>> > >>>> > >>> > >>> > >>> > >>> _______________________________________________ > >>> Users mailing list > >>> Users@ovirt.org > >>> http://lists.ovirt.org/mailman/listinfo/users > >>> > >> > > > > _______________________________________________ > > Users mailing list > > Users@ovirt.org > > http://lists.ovirt.org/mailman/listinfo/users > > >
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users