FWIW: this isn't a bug in orte_dpm, but in the MPI binding for comm_join. The 
problem is that both sides in the comm_join are setting "send_first" to true - 
i.e., both sides are trying to be the first to send on the handshake. We got 
away with this before because of a bug in orte_dpm that made the value of 
send_first irrelevant, but that has now been fixed.

So someone needs to figure out how to properly set "send_first" in comm_join so 
that the two sides agree on who does what first. Looking at the code, it isn't 
obvious to me how one would do so as I don't see any rank info passed into the 
function.


On Apr 9, 2012, at 8:31 AM, Josh Hursey wrote:

> This is totally not related to the bug report, but a neat trick in Trac.
> 
> My question was "what were the commits between r26240 and 26249"?
> In the search box type:
>  log:@26240:26249
> Or use the direct url:
>  https://svn.open-mpi.org/trac/ompi/log/?revs=26240-26249
> 
> nifty...
> 
> -- Josh
> 
> On Mon, Apr 9, 2012 at 9:17 AM, TERRY DONTJE <terry.don...@oracle.com> wrote:
>> After looking at Oracles MTT results there seem to be a (some??) regressions
>> between r26240 and 26249 detected by the ibm and intel tests suites.  An
>> example of this is the failures in the comm_join, final and loop_spawn tests
>> of the ibm test suite as seen in
>> http://www.open-mpi.org/mtt/index.php?do_redir=2055.
>> 
>> Note, I've seen similar errors detected by IU runs too.
>> 
>> I'll look further into this but I thought I would post this just in case
>> someone else has seen this.
>> --
>> Terry D. Dontje | Principal Software Engineer
>> Developer Tools Engineering | +1.781.442.2631
>> Oracle - Performance Technologies
>> 95 Network Drive, Burlington, MA 01803
>> Email terry.don...@oracle.com
>> 
>> 
>> 
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 
> 
> -- 
> Joshua Hursey
> Postdoctoral Research Associate
> Oak Ridge National Laboratory
> http://users.nccs.gov/~jjhursey
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel


Reply via email to