Re: [OMPI devel] OpenIB BTL and SRQs

Galen Shipman Thu, 12 Jul 2007 12:38:30 -0400


On Jul 12, 2007, at 10:29 AM, Don Kerr wrote:

Through mca parameters one can select the use of shared receive queues
in the openib btl, other than having fewer queues I am wondering what
are the benefits of using this option. Can anyone eleborate on using
them vs the default?

In the trunk the number of queue pairs is the same, regardless of SRQor NON-SRQ hence forth named PP (per-peer).The difference is that PP receive resources scale with the number ofactive QP connections. SRQ receive resources do not.So the real difference is the memory footprint of the the receiveresources. SRQ is potentially much smaller. This comes at a cost; SRQdoes not have flow control as we cannot reserve resources for aparticular peer, so we do have the possibility of an RNR (receivernot ready) NAK if all the shared receive resources are consumed andsome peer is still transmitting messages. This has a performancepenalty as an RNR NAK stalls the IB pipeline. With PP, we canguarantee that resources are available to the peer and thereby avoidRNR (although there is a bug in the trunk right now in that sometimeswe get RNR even with PP, but this is being worked on).

I have been working on a modification to the OpenIB BTL which allowsthe user to specify SRQ and PP QPs arbitrarily. That is we can use amix of PP and SRQ with a mix of receive sizes for each. This iscoming into the trunk very soon, perhaps tomorrow but we need toverify the branch with some additional testing.

I hope this helps, I have a paper at EuroPVM/MPI that discusses muchof this, I will send you a copy off list.


- Galen

_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] OpenIB BTL and SRQs

Reply via email to