Just to close my end of this loop: as of trunk r29213, it all works for me. Thanks!
On Sep 18, 2013, at 12:52 PM, Ralph Castain <r...@open-mpi.org> wrote: > Thanks George - much appreciated > > On Sep 18, 2013, at 9:49 AM, George Bosilca <bosi...@icl.utk.edu> wrote: > >> The test case was broken. I just pushed a fix. >> >> George. >> >> On Sep 18, 2013, at 16:49 , Ralph Castain <r...@open-mpi.org> wrote: >> >>> Hangs with any np > 1 >>> >>> However, I'm not sure if that's an issue with the test vs the underlying >>> implementation >>> >>> On Sep 18, 2013, at 7:40 AM, "Jeff Squyres (jsquyres)" <jsquy...@cisco.com> >>> wrote: >>> >>>> Does it hang when you run with -np 4? >>>> >>>> Sent from my phone. No type good. >>>> >>>> On Sep 18, 2013, at 4:10 PM, "Ralph Castain" <r...@open-mpi.org> wrote: >>>> >>>>> Strange - it works fine for me on my Mac. However, I see one difference - >>>>> I only run it with np=1 >>>>> >>>>> On Sep 18, 2013, at 2:22 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com> >>>>> wrote: >>>>> >>>>>> On Sep 18, 2013, at 9:33 AM, George Bosilca <bosi...@icl.utk.edu> wrote: >>>>>> >>>>>>> 1. sm doesn't work between spawned processes. So you must have another >>>>>>> network enabled. >>>>>> >>>>>> I know :-). I have tcp available as well (OMPI will abort if you only >>>>>> run with sm,self because the comm_spawn will fail with unreachable >>>>>> errors -- I just tested/proved this to myself). >>>>>> >>>>>>> 2. Don't use the test case attached to my email, I left an xterm based >>>>>>> spawn and the debugging. It can't work without xterm support. Instead >>>>>>> try using the test case from the trunk, the one committed by Ralph. >>>>>> >>>>>> I didn't see any "xterm" strings in there, but ok. :-) I ran with >>>>>> orte/test/mpi/intercomm_create.c, and that hangs for me as well: >>>>>> >>>>>> ----- >>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 4] >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 5] >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 6] >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 7] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 4] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 5] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 6] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 7] >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> [hang] >>>>>> ----- >>>>>> >>>>>> Similarly, on my Mac, it hangs with no output: >>>>>> >>>>>> ----- >>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>> [hang] >>>>>> ----- >>>>>> >>>>>>> George. >>>>>>> >>>>>>> On Sep 18, 2013, at 07:53 , "Jeff Squyres (jsquyres)" >>>>>>> <jsquy...@cisco.com> wrote: >>>>>>> >>>>>>>> George -- >>>>>>>> >>>>>>>> When I build the SVN trunk (r29201) on 64 bit linux, your attached >>>>>>>> test case hangs: >>>>>>>> >>>>>>>> ----- >>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 4] >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 5] >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 6] >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 7] >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 4] >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 5] >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 6] >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 7] >>>>>>>> [hang] >>>>>>>> ----- >>>>>>>> >>>>>>>> On my Mac, it hangs without printing anything: >>>>>>>> >>>>>>>> ----- >>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>>>> [hang] >>>>>>>> ----- >>>>>>>> >>>>>>>> >>>>>>>> On Sep 18, 2013, at 1:48 AM, George Bosilca <bosi...@icl.utk.edu> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Here is a quick (and definitively not the cleanest) patch that >>>>>>>>> addresses the MPI_Intercomm issue at the MPI level. It should be >>>>>>>>> applied after removal of 29166. >>>>>>>>> >>>>>>>>> I also added the corrected test case stressing the corner cases by >>>>>>>>> doing barriers at every inter-comm creation and doing a clean >>>>>>>>> disconnect. >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Jeff Squyres >>>>>>>> jsquy...@cisco.com >>>>>>>> For corporate legal information go to: >>>>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> devel mailing list >>>>>>>> de...@open-mpi.org >>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>>>> >>>>>>> _______________________________________________ >>>>>>> devel mailing list >>>>>>> de...@open-mpi.org >>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>>> >>>>>> >>>>>> -- >>>>>> Jeff Squyres >>>>>> jsquy...@cisco.com >>>>>> For corporate legal information go to: >>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>>> >>>>>> _______________________________________________ >>>>>> devel mailing list >>>>>> de...@open-mpi.org >>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>> >>>>> _______________________________________________ >>>>> devel mailing list >>>>> de...@open-mpi.org >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>> _______________________________________________ >>>> devel mailing list >>>> de...@open-mpi.org >>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>> >>> _______________________________________________ >>> devel mailing list >>> de...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >> >> _______________________________________________ >> devel mailing list >> de...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > > _______________________________________________ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/