Re: [OMPI devel] singleton appears to be broken

2014-02-07 Thread George Bosilca
It is difficult to see it from the stack trace, as it happens in the ORTE threads. But I do have all the output I expect, and as the application I was running is hello_world I’m almost certain it happens during MPI_Finalize. George. On Feb 7, 2014, at 03:38 , Ralph Castain wrote: > Interest

Re: [OMPI devel] singleton appears to be broken

2014-02-07 Thread Ralph Castain
Think I see the code path that causes this - I'll have to play with it a little as the race condition is biased heavily towards success, so (as you noted) it won't happen very often. On Feb 6, 2014, at 6:38 PM, Ralph Castain wrote: > Interesting - does it happen in finalize, or in the middle o

Re: [OMPI devel] singleton appears to be broken

2014-02-06 Thread Ralph Castain
Interesting - does it happen in finalize, or in the middle of execution? On Feb 6, 2014, at 5:57 PM, George Bosilca wrote: > Out of 150 runs I could reproduce it once. When it failed I got exactly the > same assert: > > hello: ../../../../ompi/orte/mca/rml/base/rml_base_msg_handlers.c:75: >

Re: [OMPI devel] singleton appears to be broken

2014-02-06 Thread George Bosilca
Out of 150 runs I could reproduce it once. When it failed I got exactly the same assert: hello: ../../../../ompi/orte/mca/rml/base/rml_base_msg_handlers.c:75: orte_rml_base_post_recv: Assertion `((0xdeafbeedULL << 32) + 0xdeafbeedULL) == ((opal_object_t *) (recv))->obj_magic_id’ failed. A quic

Re: [OMPI devel] singleton appears to be broken

2014-02-06 Thread George Bosilca
A rather long configure line: ./configure —enable-picky —enable-debug —enable-coverage —disable-heterogeneous —enable-visibility —enable-contrib-no-build=vt —enable-mpirun-prefix-by-default --disable-mpi-cxx --with-cma --enable-static --enable-mca-no-build=plm-tm,ess-tm,ras-tm,plm-tm,ras-slurm,

Re: [OMPI devel] singleton appears to be broken

2014-02-06 Thread Ralph Castain
Oh, should have noted: that's on both trunk and 1.7.4 On Feb 6, 2014, at 4:10 PM, Ralph Castain wrote: > Works for me on Mac and Linux/Centos6.2 as well > > > On Feb 6, 2014, at 4:00 PM, Jeff Squyres (jsquyres) > wrote: > >> I'm unable to replicate on Linux/RHEL/64 bit with a trunk build.

Re: [OMPI devel] singleton appears to be broken

2014-02-06 Thread Ralph Castain
Works for me on Mac and Linux/Centos6.2 as well On Feb 6, 2014, at 4:00 PM, Jeff Squyres (jsquyres) wrote: > I'm unable to replicate on Linux/RHEL/64 bit with a trunk build. How did you > configure? Here's my configure: > > ./configure --prefix=/home/jsquyres/bogus --disable-vt > --enable-

Re: [OMPI devel] singleton appears to be broken

2014-02-06 Thread Jeff Squyres (jsquyres)
I'm unable to replicate on Linux/RHEL/64 bit with a trunk build. How did you configure? Here's my configure: ./configure --prefix=/home/jsquyres/bogus --disable-vt --enable-mpirun-prefix-by-default --disable-mpi-fortran Does this happen with every run? On Feb 6, 2014, at 6:53 PM, George Bos