On 1/27/2015 12:00 AM, Doug Ledford wrote:
However, I didn't get more than 5 minutes into testing before I was able
to livelock the system. In this case, from machine A running my
patchset, I did
ping6 -I mlx4_ib0 -i .25 <machine B address>
On machine B running Erez's patch, I did:
rmmod ib_ipoib; modprobe ib_ipoib mcast_debug_level=1; sleep 2; ping6
-i .25 -c 10 -I mlx4_ib0 <machine A address>
And on the machine rdma-master, where the opensm runs, I did just a few:
systemctl restart opensm
The livelock is in the mcast flushing code. On the machine that livelocked
Doug,
The tests you are running and the issues you are seeing fall well into a
to-be-fixed-in-some-kernel-rc1 category but by NO means as something
which should be an rc6 fix.
You must do the distinction between Erez's patch that fixes the
regressions introduced on 3.19-rc1 to your attempts to fix many more
instabilities in the IPoIB driver, which are seen under whatever nasty
test you are running (and it's good we want to reach there).
Roland, the V3 patch solves the rc1 regression and I think we should
pick it up, by no way we can allow to pick eleven patches @ this point.
Thoughts?
Or.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html