Hi, We deployed openvswitch 1.4.6 on CentOS 6.4 in our production envioronment. OVS were used to bridge KVM VMs.
Recently, we found some VMs lost network connections randomly. When the event happened, we login into the VMs through VNC console and found the ARP tables were full of incomplete entries. It seemed that the ARP packets in/out of the VMs were being dropped. We checked the ovs-vswitchd message logs when the network disconnection happened, we found most of the messages complained high CPU usage of ovs-switchd and "dropped log messages". (Samples are attached at the end of email) I'm writing the email for your kind help. 1. Did the "dropped log messages" cause the VMs losing network connections? 2. Is there any quick fix to this issue except replacing the OVS kernel module as we couldn't reboot the hypervisor? Many thanks, Jian Qiu Aug 19 18:49:23 db-cloud02 ovs-vswitchd: 3127360|poll_loop|WARN|wakeup due to [POLLIN] on fd 36 (file anon_inode:[eventpoll]) at lib/dpif-linux.c:1197 (52% CPU usage) Aug 19 18:49:23 db-cloud02 ovs-vswitchd: 3127361|poll_loop|WARN|wakeup due to [POLLIN] on fd 16 (file anon_inode:[eventpoll]) at lib/dpif-linux.c:1197 (52% CPU usage) Aug 19 18:49:23 db-cloud02 ovs-vswitchd: 3127362|poll_loop|WARN|Dropped 613 log messages in last 1 seconds (most recently, 1 seconds ago) due to excessive rate Aug 19 18:49:23 db-cloud02 ovs-vswitchd: 3127363|poll_loop|WARN|wakeup due to [POLLIN] on fd 16 (file anon_inode:[eventpoll]) at lib/dpif-linux.c:1197 (52% CPU usage) Aug 19 18:49:23 db-cloud02 ovs-vswitchd: 3127364|poll_loop|WARN|wakeup due to [POLLIN] on fd 16 (file anon_inode:[eventpoll]) at lib/dpif-linux.c:1197 (52% CPU usage) Aug 19 18:49:24 db-cloud02 ovs-vswitchd: 3127365|poll_loop|WARN|Dropped 1046 log messages in last 1 seconds (most recently, 1 seconds ago) due to excessive rate Aug 19 18:49:24 db-cloud02 ovs-vswitchd: 3127366|poll_loop|WARN|wakeup due to [POLLIN] on fd 16 (file anon_inode:[eventpoll]) at lib/dpif-linux.c:1197 (52% CPU usage) Aug 19 18:49:24 db-cloud02 ovs-vswitchd: 3127367|poll_loop|WARN|wakeup due to [POLLIN] on fd 16 (file anon_inode:[eventpoll]) at lib/dpif-linux.c:1197 (52% CPU usage)
_______________________________________________ discuss mailing list [email protected] http://openvswitch.org/mailman/listinfo/discuss
