I'm referring to these:

ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff, status 
-11

The patch in http://lists.openfabrics.org/pipermail/general/2008-May/050551.html is causing them.
The patch creates a state when there is no sm_ah, so all alloc_mad() calls
return -11 (-EAGAIN), this goes back to ipoib multicast join: ipoib asks the sa
to join, it queues work that calls send_join(), this calls 
ib_sa_mcmember_rec_query(),
this one calls alloc_mad() and gets -EAGAIN.

How about lowering the severity of this error in ipoib_mcast_join_complete() 
from warning
to debug?

Index: b/drivers/infiniband/ulp/ipoib/ipoib_multicast.c
===================================================================
--- a/drivers/infiniband/ulp/ipoib/ipoib_multicast.c    2008-10-22 
20:28:06.000000000 +0200
+++ b/drivers/infiniband/ulp/ipoib/ipoib_multicast.c    2008-10-27 
20:13:59.000000000 +0200
@@ -443,7 +443,7 @@ static int ipoib_mcast_join_complete(int
        }

        if (mcast->logcount++ < 20) {
-               if (status == -ETIMEDOUT) {
+               if (status == -ETIMEDOUT || status == -EAGAIN) {
                        ipoib_dbg_mcast(priv, "multicast join failed for " 
IPOIB_GID_FMT
                                        ", status %d\n",
                                        IPOIB_GID_ARG(mcast->mcmember.mgid),

--
--Yossi
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to