Re: [OMPI devel] RFC: eliminating "descriptor" argument from sendi function

George Bosilca Mon, 23 Feb 2009 15:26:04 -0500


On Feb 23, 2009, at 12:14 , Eugene Loh wrote:

I'm a newbie and George is a veteran. So, this feels rather likeDavid and Goliath. (Hmm, David won and became king. Gee, I kindalike that.) Anyhow...


That's an old story, we're living in modern times now ;)

George Bosilca wrote:
It doesn't sound reasonable to me. There is a reason for this, andI think it's a good reason. The sendi function work for somedevices as a fast path for sending data, when the network is notflooded. However, in the case sendi cannot do the job we expect,the fact that it return the descriptor save us a call (we don'thave to do the alloc call later).
This does not make any sense to me. In what sense are we "saving acall"? Not in the sense of run-time performance since the BTL isnow having to allocate a descriptor it did not otherwise need. Theamount of work is the same (one descriptor allocation either way),but you're just pushing that work into the BTLs.

The descriptor is a BTL resource. If the sendi doesn't return one, thePML will have to call the BTL alloc function from the BTL again (inthis case the calls will look like this: btl_sendi followed bybtl_alloc followed by btl_send). I'm not looking only at SM, I wantall of the BTL to have the opportunity to get good performance.

If sendi return a descriptor when it fails to send the data the calllist will be shorter: btl_sendi followed by btl_send. I'm trying todecrease the number of jumps between the layers (PML/BTL), not thenumber of lines of code in the BTL.

We are certainly not "saving a call" in the sense of reducing sourcecode. The PML has to have code to allocate a descriptor anywaysince there may not even be any sendi functions. So, the code toallocate the descriptor is already in the PML. By asking sendifunctions to do the same, you're replicating that code in everysendi function... possibly multiple times per BTL since a sendifunction might have multiple "out of resource" return paths.
Therefore, in the PML we already have the descriptor and we canhand it back to the BTL, which give a chance for asynchronousprogress later on. Without this descriptor, the only option thePML have is to put the PML request in a queue, and to try to sendit later, which is very expensive.
This also makes no sense to me. We're not talking about doingwithout the descriptor. The PML is prepared to allocate it anyhow.The issue is where the descriptor is allocated in the case thatsendi functions exist but cannot succeed. One alternative is to usea single allocation point in the PML. The other alternative (whatwe have today) is to replicate that code out to multiple sites,adding unnecessary source code and interface arguments.

As I said previously, this save one jump from the PML to the BTL byadding one more return argument to the sendi function and some linesof code in every BTL. Not a big deal as a correctly written BTL can doit pretty smartly (as an example special return case where everybodyjumps when an error is detected).


  george.

The PML code is in
https://svn.open-mpi.org/source/xref/ompi_1.3/ompi/mca/pml/ob1/pml_ob1_sendreq.c#mca_pml_ob1_send_request_start_copy
Existing BTL sendi functions are at
https://svn.open-mpi.org/source/xref/ompi_1.3/ompi/mca/btl/sm/btl_sm.c#mca_btl_sm_sendihttps://svn.open-mpi.org/source/xref/ompi_1.3/ompi/mca/btl/mx/btl_mx.c#mca_btl_mx_sendihttps://svn.open-mpi.org/source/xref/ompi_1.3/ompi/mca/btl/portals/btl_portals_send.c#mca_btl_portals_sendi
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] RFC: eliminating "descriptor" argument from sendi function

Reply via email to