Hi Bobby, Can you please share the engine logs as well? It could help to understand what happened there.
Right now, looking at the pieces of the logs you sent I couldn't spot anything unusual. Thanks in advance, On Mon, Jun 29, 2020 at 10:40 PM Bobby <[email protected]> wrote: > Hello, > > All 4 VMs on one of my oVirt cluster node shutdown for an unknown reason > almost simultaneously. > Please help me to find the root cause. > Thanks. > > Please note the host seems doing fine and never crash or hangs and I can > migrate VMs back to it later. > Here is the exact timeline of all the related events combined from the > host and the VM(s): > > On oVirt host: > /var/log/vdsm/vdsm.log: > 2020-06-25 15:25:16,944-0500 WARN (qgapoller/3) > [virt.periodic.VmDispatcher] could not run <function <lambda> at > 0x7f4ed2f9f5f0> on ['e0257b06-28fd-4d41-83a9-adf1904d3622'] (periodic:289) > 2020-06-25 15:25:19,203-0500 WARN (libvirt/events) [root] File: > /var/lib/libvirt/qemu/channels/e0257b06-28fd-4d41-83a9-adf1904d3622.ovirt-guest-agent.0 > already removed (fileutils:54) > 2020-06-25 15:25:19,203-0500 WARN (libvirt/events) [root] File: > /var/lib/libvirt/qemu/channels/e0257b06-28fd-4d41-83a9-adf1904d3622.org.qemu.guest_agent.0 > already removed (fileutils:54) > > [root@athos log]# journalctl -u NetworkManager --since=today > -- Logs begin at Wed 2020-05-20 22:07:33 CDT, end at Thu 2020-06-25 > 16:36:05 CDT. -- > Jun 25 15:25:18 athos NetworkManager[1600]: <info> [1593116718.1136] > device (vnet0): state change: disconnected -> unmanaged (reason > 'unmanaged', sys-iface-state: 'removed') > Jun 25 15:25:18 athos NetworkManager[1600]: <info> [1593116718.1146] > device (vnet0): released from master device SRV-VL > > /var/log/messages: > Jun 25 15:25:18 athos kernel: SRV-VL: port 2(vnet0) entered disabled state > Jun 25 15:25:18 athos NetworkManager[1600]: <info> [1593116718.1136] > device (vnet0): state change: disconnected -> unmanaged (reason > 'unmanaged', sys-iface-state: 'removed') > Jun 25 15:25:18 athos NetworkManager[1600]: <info> [1593116718.1146] > device (vnet0): released from master device SRV-VL > Jun 25 15:25:18 athos libvirtd: 2020-06-25 20:25:18.122+0000: 2713: error > : qemuMonitorIO:718 : internal error: End of file from qemu monitor > > /var/log/libvirt/qemu/aries.log: > 2020-06-25T20:25:28.353975Z qemu-kvm: terminating on signal 15 from pid > 2713 (/usr/sbin/libvirtd) > 2020-06-25 20:25:28.584+0000: shutting down, reason=shutdown > > > ============================================================================================= > On the first VM effected (same thing on others): > /var/log/ovirt-guest-agent/ovirt-guest-agent.log: > MainThread::INFO::2020-06-25 > 15:25:20,270::ovirt-guest-agent::104::root::Stopping oVirt guest agent > CredServer::INFO::2020-06-25 > 15:25:20,626::CredServer::262::root::CredServer has stopped. > MainThread::INFO::2020-06-25 > 15:25:21,150::ovirt-guest-agent::78::root::oVirt guest agent is down. > > > ============================================================================================= > Packages version installated: > Host OS version: CentOS 7.7.1908: > ovirt-hosted-engine-ha-2.3.5-1.el7.noarch > ovirt-provider-ovn-driver-1.2.22-1.el7.noarch > ovirt-release43-4.3.6-1.el7.noarch > ovirt-imageio-daemon-1.5.2-0.el7.noarch > ovirt-vmconsole-1.0.7-2.el7.noarch > ovirt-imageio-common-1.5.2-0.el7.x86_64 > ovirt-engine-sdk-python-3.6.9.1-1.el7.noarch > ovirt-vmconsole-host-1.0.7-2.el7.noarch > ovirt-host-4.3.4-1.el7.x86_64 > libvirt-4.5.0-23.el7_7.1.x86_64 > libvirt-daemon-4.5.0-23.el7_7.1.x86_6 > qemu-kvm-ev-2.12.0-33.1.el7.x86_64 > qemu-kvm-common-ev-2.12.0-33.1.el7.x86_64 > > On guest VM: > ovirt-guest-agent-1.0.13-1.el6.noarch > qemu-guest-agent-0.12.1.2-2.491.el6_8.3.x86_64 > _______________________________________________ > Users mailing list -- [email protected] > To unsubscribe send an email to [email protected] > Privacy Statement: https://www.ovirt.org/privacy-policy.html > oVirt Code of Conduct: > https://www.ovirt.org/community/about/community-guidelines/ > List Archives: > https://lists.ovirt.org/archives/list/[email protected]/message/LGQSLTNG37VZDJM2GYXRVHPSLWOLOKSC/ > -- Lev Veyde Senior Software Engineer, RHCE | RHCVA | MCITP Red Hat Israel <https://www.redhat.com> [email protected] | [email protected] <https://red.ht/sig> TRIED. TESTED. TRUSTED. <https://redhat.com/trusted>
_______________________________________________ Users mailing list -- [email protected] To unsubscribe send an email to [email protected] Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/[email protected]/message/EJA44VI5GHUDEBCK4DWBXWIQMRIPIAPU/

