Hi Anton, I am not sure if changing this value would fix the issue. Defaults are pretty high. For example vdsHeartbeatInSeconds=30seconds, vdsTimeout=180seconds, vdsConnectionTimeout=20seconds.
Do you still have relevant logs from the affected hosts: * /var/logs/vdsm/vdsm.log* * /var/logs/vdsm/supervdsm.log* Please look for any jsonrpc errors ie. write/read errors or (connection) timeouts. Storage related warnings/errors might also be relevant. Plus system logs if possible: *journalctl -f /usr/share/vdsm/vdsmd* *journalctl -f /usr/sbin/libvirtd* In order to get system logs from particular time period please combine it with the following example using -S -U options: *journalctl -S "2020-01-12 07:00:00" -U "2020-01-12 07:15:00" * I haven't a clue what to look there for besides any warnings/errors or anything else that seems .... unusual. Artur On Thu, Sep 17, 2020 at 8:09 AM Anton Louw via Users <email@example.com> wrote: > > > Hi Everybody, > > > > Did some digging around, and saw a few things regarding > “vdsHeartbeatInSeconds” > > I had a look at the properties file located at > /etc/ovirt-engine/engine-config/engine-config.properties, and do not see an > entry for “vdsHeartbeatInSeconds.type=Integer”. > > Seeing as these data centers are geographically split, could the > “vdsHeartbeatInSeconds” potentially be the issue? Is it safe to increase this > value after I add “vdsHeartbeatInSeconds.type=Integer” into my > engine-config.properties file? > > > > Thanks > > > > *Anton Louw* > *Cloud Engineer: Storage and Virtualization* at *Vox* > ------------------------------ > *T:* 087 805 0000 | *D:* 087 805 1572 > *M:* N/A > *E:* anton.l...@voxtelecom.co.za > *A:* Rutherford Estate, 1 Scott Street, Waverley, Johannesburg > www.vox.co.za > > [image: F] <https://www.facebook.com/voxtelecomZA> > [image: T] <https://www.twitter.com/voxtelecom> > [image: I] <https://www.instagram.com/voxtelecomza/> > [image: L] <https://www.linkedin.com/company/voxtelecom> > [image: Y] <https://www.youtube.com/user/VoxTelecom> > > *From:* Anton Louw via Users <firstname.lastname@example.org> > *Sent:* 16 September 2020 09:01 > *To:* email@example.com > *Subject:* [ovirt-users] Random hosts disconnects > > > > > > Hi All, > > > > I have a strange issue in my oVirt environment. I currently have a > standalone manager which is running in VMware. In my oVirt environment, I > have two Data Centers. The manager is currently sitting on the same subnet > as DC1. Randomly, hosts in DC2 will say “Not Responding” and then 2 seconds > later, the hosts will activate again. > > > > The strange thing is, when the manager was sitting on the same subnet as > DC2, hosts in DC1 will randomly say “Not Responding” > > > > I have tried going through the logs, but I cannot see anything out of the > ordinary regarding why the hosts would drop connection. I have attached the > engine.log for anybody that would like to do a spot check. > > > > Thanks > > > > *Anton Louw* > > *Cloud Engineer: Storage and Virtualization* at *Vox* > ------------------------------ > > *T:* 087 805 0000 | *D:* 087 805 1572 > *M:* N/A > *E:* anton.l...@voxtelecom.co.za > *A:* Rutherford Estate, 1 Scott Street, Waverley, Johannesburg > www.vox.co.za > > > > [image: F] <https://www.facebook.com/voxtelecomZA> > > > > [image: T] <https://www.twitter.com/voxtelecom> > > > > [image: I] <https://www.instagram.com/voxtelecomza> > > > > [image: L] <https://www.linkedin.com/company/voxtelecom> > > > > [image: Y] <https://www.youtube.com/user/VoxTelecom> > > > > > > [image: #VoxBrand] > <https://www.vox.co.za/fibre/fibre-to-the-home/?prod=HOME> > > > *Disclaimer* > > The contents of this email are confidential to the sender and the intended > recipient. Unless the contents are clearly and entirely of a personal > nature, they are subject to copyright in favour of the holding company of > the Vox group of companies. Any recipient who receives this email in error > should immediately report the error to the sender and permanently delete > this email from all storage devices. > > This email has been scanned for viruses and malware, and may have been > automatically archived by *Mimecast Ltd*, an innovator in Software as a > Service (SaaS) for business. Providing a *safer* and *more useful* place > for your human generated data. Specializing in; Security, archiving and > compliance. To find out more Click Here > <https://www.voxtelecom.co.za/security/mimecast/?prod=Enterprise>. > > > > _______________________________________________ > Users mailing list -- firstname.lastname@example.org > To unsubscribe send an email to users-le...@ovirt.org > Privacy Statement: https://www.ovirt.org/privacy-policy.html > oVirt Code of Conduct: > https://www.ovirt.org/community/about/community-guidelines/ > List Archives: > https://email@example.com/message/EJL246IPBGEHIQ5KUWG2APSTQWFE7VFK/ > -- Artur Socha Senior Software Engineer, RHV Red Hat
_______________________________________________ Users mailing list -- firstname.lastname@example.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://email@example.com/message/QH5PLNZ5KJI7MKV4LMRK3PYGVGWB7E5H/