George,
When asked about MTL versus BTL, we always suggest that users try both
with their application and determine which is best. I have had
customers report the BTL is better on Solaris (memory registration is
expensive and the BTL can overlap registration and communication when
it fragments a large message) and sometimes better on Linux, but not
always.
The most common issue lately is that users see a failure on high core
count machines (8 or 16) due to the fact that both the MTL and BTL are
opening endpoints. They run into the max number of allowable endpoints
and OMPI aborts. I would suggest that OMPI clearly selects one CM and
only open endpoints for that CM, if possible.
Scott
On Aug 11, 2009, at 3:29 PM, George Bosilca wrote:
Here is an alternative solution. If instead of setting a hard coded
value for the priority of CM, we make it use the priority of the MTL
that get selected, we can solve this problem on a case by case
approach by carefully setting the MTL's priority (bump up the
portals and PSM one and decrease the MX MTL). As a result we can
remove all the extra selection logic and priority management from
the pml_cm_component.c, and still have a satisfactory solution for
everybody.
george.
On Aug 11, 2009, at 15:23 , Brian W. Barrett wrote:
On Tue, 11 Aug 2009, Rainer Keller wrote:
When compiling on systems with MX or Portals, we offer MTLs and
BTLs.
If MTLs are used, the PML/CM is loaded as well as the PML/OB1.
Question 1: Is favoring OB1 over CM required for any MTL (MX,
Portals, PSM)?
George has in the past had srtong feelings on this issue, believing
that for MX, OB1 is prefered over CM. For Portals, it's probably
in the noise, but the BTL had been better tested than the MTL, so
it was left as the default. Obviously, PSM is a much better choice
on InfiniPath than straight OFED, hence the odd priority bump.
At this point, I would have no objection to making CM's priority
higher for Portals.
Question 2: If it is, I would like to reflect this in the default
priorities,
aka have CM have a priority lower than OB1 and in the case of PSM
raising it.
I don't have strong feelings on this one.
Brian
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel