> Quoting Sean Hefty <[EMAIL PROTECTED]>:
> Subject: Re: [ofa-general] bug 418: was OFED 1.2 beta blocking bugs
> 
> >     ib0: failed send event (status=1, wrid=35 vend_err 69)
> 
> I believe that this is causing the QP to transition into the error state.
> 
> >     ib_mthca 0000:08:00.0: modify QP 3->3 returned status 10.
> 
> The mthca status of 0x10 indicates a bad QP state.  The transition from 3->3 
> is 
> RTS to RTS, but the QP is not in the RTS state, which makes sense given the 
> previous error.  The other receive side errors in the bug report are a 
> fallout 
> from not recovering from the send error.

Errors on UD QP typically indicates a software problem.
It seems we are posting packets that exceed the MTU size.
But I do not see this problem here at the lab.
How to reproduce this problem?

> I don't know if this causes any problems, but at first glance it appears that 
> the IPoIB CM code begins listening for connection requests before the code 
> has 
> had a chance to join the IPoIB broadcast group.  This allows a connection to 
> form before the broadcast traffic is ready.  Someone more familiar with the 
> code 
> than I am will need to determine if this can lead to any undesirable race 
> conditions.

I don't see why is this a problem - I don't need to be a member of a broadcast 
group
to get incoming packets.

-- 
MST
_______________________________________________
general mailing list
general@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to