On 1/27/2015 12:00 AM, Doug Ledford wrote:
However, I didn't get more than 5 minutes into testing before I was able
to livelock the system.  In this case, from machine A running my
patchset, I did

ping6 -I mlx4_ib0 -i .25 <machine B address>

On machine B running Erez's patch, I did:

rmmod ib_ipoib; modprobe ib_ipoib mcast_debug_level=1; sleep 2; ping6
-i .25 -c 10 -I mlx4_ib0 <machine A address>

And on the machine rdma-master, where the opensm runs, I did just a few:

systemctl restart opensm

The livelock is in the mcast flushing code.  On the machine that livelocked

Doug,

The tests you are running and the issues you are seeing fall well into a to-be-fixed-in-some-kernel-rc1 category but by NO means as something which should be an rc6 fix.

You must do the distinction between Erez's patch that fixes the regressions introduced on 3.19-rc1 to your attempts to fix many more instabilities in the IPoIB driver, which are seen under whatever nasty test you are running (and it's good we want to reach there).

Roland, the V3 patch solves the rc1 regression and I think we should pick it up, by no way we can allow to pick eleven patches @ this point.

Thoughts?

Or.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to