Re: [OMPI devel] RFC: eliminating "descriptor" argument from sendi function

George Bosilca Tue, 24 Feb 2009 10:43:32 -0500

Here is another way to write the code without having to pay theexpensive initialization of sendreq.

  first_time = 0;
  for ( btl = ... ) {
      if ( SUCCESS == sendi() ) return SUCCESS;
      if( 0 == first_time++)  set_up_expensive_send_request(&sendreq);
      if ( SUCCESS == send(&sendreq) ) return SUCESS;
  }

Anyway, the main problem is not in this code. The main problem is inthe fact that now instead of sharing the load over all available BTLin a round-robin fashion, you overload the BTL(s) providing the sendifunction with small (and eager) messages, and you completely ignoreall the others until something goes wrong.

However, I can see one interesting point in your approach. As the BTLsare indexed in increasing order of their published latency in theeager array, we might benefit from the smallest latency for severalsmall messages before taking the most expensive path. But this is notsomething we should tackle allegedly, as it modify the mostperformance related parts of the PML.


  george.

On Feb 23, 2009, at 18:07 , Eugene Loh wrote:

Eugene Loh wrote:
Actually, there may be a more important issue here.
Currently, the PML chooses the BTL first. Once the BTL choice isestablished, only then does the PML choose between sendi and send.
Currently, it's also the case that we're spending a lot of time inthe PML doing a bunch of stuff that's totally unnecessary if thesendi succeeds. So, we're neutralizing much of the advantage sendiis supposed to provide.
So, I'm changing the PML to invoke sendi much sooner. The way I'mdoing this is to loop over BTLs, looking for a sendi that existsand succeeds. If I find one, I'm done. If I don't, I have to gowith the standard send code path.
The logic, as I just described it, allows that multiple sendifunctions could fail and that the send that is ultimately usedmight be for a different BTL than for any of the failing sendi's.This would suggest that I do NOT want failing sendi's leaving anyside effects (like allocated descriptors).
Is my proposed logic bad? Should I implement things another way?E.g., if I find a sendi function, use that BTL even if the sendifailed and another BTL might have a sendi that could succeed? Or,does my proposed change provide the justification for my pullingdescriptor allocations out of the sendi functions?
Here's another way of looking at it.

The current PML send code does this:

  set_up_expensive_send_request(&sendreq);
  for ( btl = ... ) {
      if ( SUCCESS == sendi() ) return SUCCESS;
      if ( SUCCESS == send(&sendreq) ) return SUCESS;
  }
That is, we try one BTL after another. For each one, we try sendifirst. So, each sendi() that fails is immediately followed by asend() of the same BTL. It's okay for a sendi() to do prep work forthe send() of the same BTL. This scheme does a bunch of expensivesend-request initialization that is unnecessary if the sendi(),which doesn't need the send request, succeeds.
My proposed PML send logic is this:

  for ( btl = ... ) {
      if ( SUCCESS == sendi() ) return SUCCESS;
  }
  set_up_expensive_send_request(&sendreq);
  for ( btl = ... ) {
      if ( SUCCESS == send(&sendreq) ) return SUCCESS;
  }
That is, if I can find a sendi() function, I use it. Only if Ican't find any sendi() do I set up the send request and call send()functions.
This is why I would like sendi() functions to have no sideeffects... e.g., no allocated descriptors.
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] RFC: eliminating "descriptor" argument from sendi function

Reply via email to