> > The current code at least tries to do this: it decrements the recv_count
 > > on every receive completion with unsuccessful status, and moves an rx
 > > countext to the reap list when its last receive completes.  I don't see
 > > any obvious bug here; is it not working in your tests?

 > Right, it doesn't work.

Do you have any further information about what goes wrong?

 > Do the post send work request whenever receiving DREQ, which meant the
 > remote TX QP has already teared down, there is no new post recv
 > completions any more after DREQ.

After a DREQ is received, then the local QP is transitioned to the error
state.  However, we don't know when all the receives queued up have
completed (with flush error status).  Also, we may want to clean up a QP
when we didn't receive a DREQ (remote side crashed, or we just have an
idle connection).

 - R.
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to