CPU utilization with kvm / vhost, differences 3.14 / 4.4 / 4.6

2016-07-27 Thread Patrick Schaaf
Hi, I'm stumped by a weird development in measured CPU utilization when testing an upgrade path from 3.14.70 to 4.4.14. I'm running, on identical hardware (2 4-core Xeon E5420), a HA (active/standby) pair of firewall/loadbalancer VMs. The OS on the host and the VM is identical - openSUSE 13.1

kernel panic with kernel 3.14.70, LVS on keepalived restart

2016-05-24 Thread Patrick Schaaf
Dear LVS users / netdev readers, today I've got a pretty peculiar problem. I've been running 3.14.48 (and some earlier 3.14 kernels) for a long time now in an LVS / keepalived driven loadbalancing cluster. See below for more detail on the setup. Today I started to upgrade to the current 3.14.7

Re: bridge not learning from locally sent gratuitous ARP?

2016-04-21 Thread Patrick Schaaf
On Thu, Apr 21, 2016 at 12:31 PM, Toshiaki Makita wrote: > On 2016/04/21 15:37, Patrick Schaaf wrote: > (I understand the problem happens only if you use macvlan on the bridge > device. If wrong, correct me.) That is my understanding, yes. That macvlan device is created by keepalived

bridge not learning from locally sent gratuitous ARP?

2016-04-20 Thread Patrick Schaaf
Dear netdev, I've got a peculiar issue, and hope for clarification / workarounds here. Scenario: - a bridge interface br0, over some ethernet base - a macvlan interface br0-vrrp on top, set up by keepalived, with VRRP VMAC - keepalived regularly sending gratuitous ARP with that VRRP VMAC - (new)

Re: [PATCH] [BACKPORT] [3.14.56] bnx2x: Don't notify about scratchpad parities

2015-12-10 Thread Patrick Schaaf
On Friday 06 November 2015 09:32:46 Greg KH wrote: > On Thu, Nov 05, 2015 at 11:18:37AM +0100, Patrick Schaaf wrote: > > bnx2x: Don't notify about scratchpad parities > > > > This is a (trivial) "backport" of ad6afbe9578d1fa26680faf78c846bd8c00d1d6e > >

Re: Kernel 4.1 hang, apparently in __inet_lookup_established

2015-11-16 Thread Patrick Schaaf
On Sunday 15 November 2015 16:58:33 Grant Zhang wrote: > > Have you tried the two patches Eric mentioned? One of my 4.1.11 server > just hanged with very similar stack trace and I am wondering whether the > aforementioned patches would help. Sorry, Grant - I'm sticking to 3.14.xx for now. best r

Re: [PATCH] [BACKPORT] [3.14.56] bnx2x: Don't notify about scratchpad parities

2015-11-06 Thread Patrick Schaaf
On Friday 06 November 2015 09:32:46 Greg KH wrote: > On Thu, Nov 05, 2015 at 11:18:37AM +0100, Patrick Schaaf wrote: > > bnx2x: Don't notify about scratchpad parities > > > > This is a (trivial) "backport" of ad6afbe9578d1fa26680faf78c846bd8c00d1d6e > >

[PATCH] [BACKPORT] [3.14.56] bnx2x: Don't notify about scratchpad parities

2015-11-05 Thread Patrick Schaaf
ot even see a parity being logged in the system. Signed-off-by: Patrick Schaaf Tested-by: Patrick Schaaf --- Related discussion + more info in http://marc.info/?l=linux-netdev&m=144663711626469 I experienced a production server network outage where over 1 million kernel messages were produc

Re: kernel 3.14.53 + bnx2x loss of connectivity / parity errors / MCP SCPAD

2015-11-05 Thread Patrick Schaaf
Hi Yuval, thanks for your notes. > 4. The patch you've listed merely removes the MCP SCPAD prints, as they're > unavoidable in certain scenarios; It doesn't actually solve anything. I also thought so, thanks for confirming. Do you know whether the messages might have hidden earlier messages poi

kernel 3.14.53 + bnx2x loss of connectivity / parity errors / MCP SCPAD

2015-11-04 Thread Patrick Schaaf
Dear netdevs, on a production server (HP DL380 Gen9 with HP 10GE dual port card - bnx2x driver), I just encountered a full loss of connectivity through the 10 GE ports. Kernel in use is vanilla 3.14.53. On the console I could see this (timestamps omitted, have to type by hand, damn ILO console

Kernel 4.1 hang, apparently in __inet_lookup_established

2015-09-23 Thread Patrick Schaaf
Dear kernel developers, I recently started to upgrade my production hosts and VMs from the 3.14 series to 4.1 kernels, starting with 4.1.6. Yesterday, for the second time after I started these upgrades, I experienced one of our webserver VMs hanging. The first time this happened, the VM hung co