Re: [OMPI devel] calling sendi earlier in the PML

Jeff Squyres Tue, 3 Mar 2009 15:48:43 -0500

On Mar 3, 2009, at 3:31 PM, Eugene Loh wrote:

First, this behavior is basically what I was proposing and whatGeorge didn't feel comfortable with. It is arguably no compromiseat all. (Uggh, why must I be so honest?) For eager messages, itfavors BTLs with sendi functions, which could lead to those BTLsbecoming overloaded. I think favoring BTLs with sendi for shortmessages is good. George thinks that load balancing BTLs is good.
Second, the implementation can be simpler than you suggest:
*) You don't need a separate list since testing for a sendi-enabledBTL is relatively cheap (I think... could verify).*) You don't need to shuffle the list. The mechanism used by ob1just resumes the BTL search from the last BTL used. E.g., check https://svn.open-mpi.org/source/xref/ompi_1.3/ompi/mca/pml/ob1/pml_ob1_sendreq.h#mca_pml_ob1_send_request_start . You usemca_bml_base_btl_array_get_next(&btl_eager) to roundrobin over BTLsin a totally fair manner (remembering where the last loop left off),and using mca_bml_base_btl_array_get_size(&btl_eager) to make sureyou don't loop endlessly.


Cool / fair enough.

How about an MCA parameter to switch between this mechanism (earlysendi) and the original behavior (late sendi)?

This is the usual way that we resolve "I want to do X / I want to doY" disputes. :-)

I've been toying with two implementations. The one I described inSan Jose was called FAST, so let's still call it that. It tests forsendi early in the PML, calling traditional send only if no sendi isfound for any BTL. To preserve the BTL ordering George favors(always roundrobinning over BTLs, looking only secondarily forsendi), I tried another implementation I'll call FAIR. It attemptsto initialize the send request only very minimally. One still makesa number of function calls and goes "deep" into the PML, but defersas much send-request initialization as late as possible. I can'tpromise that both implementations FAST and FAIR are equally rocksolid or optimized, but this is where I am so far. The differencesare:
*) FAST involves far fewer code changes.
*) FAST produces faster latencies. E.g., for 0-byte OSU latencies,FAST is 8-10% better than OMPI while FAIR is only 1-3% (or 2-3%...something like that). (The improvements I showed in San Jose forFAST were more dramatic than 8-10%, but that's because there wereoptimizations on the receive side and in the data convertors aswell. For the e-mail you're reading right now, I'm talking justabout send-request optimizations.)*) Theoretically, FAIR is broader reaching. E.g., if persistentsends can always use a sendi path, they will all potentiallybenefit. (This is theory. I haven't actually observed such a speed-up yet and it might just end up getting lost in the noise.)
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel



--
Jeff Squyres
Cisco Systems

Re: [OMPI devel] calling sendi earlier in the PML

Reply via email to