I’m working on providing the info, guys - just sitting in a branch right now. Too many meetings...sigh.
> On Aug 11, 2016, at 10:09 AM, George Bosilca <bosi...@icl.utk.edu> wrote: > > I just pushed a solution to this problem in 8d0baf140f. If we are unable to > extract the expected information from the RTE, we simply build a > non-reordered communicator and gracefully return. > > That being said, not being able to correctly retrieve OPAL_PMIX_NODEID has > the potential to drastically decrease the performance as no specialized > hierarchies can be built without the RTE information. > > George. > > > On Wed, Aug 10, 2016 at 3:57 AM, Gilles Gouaillardet <gil...@rist.or.jp > <mailto:gil...@rist.or.jp>> wrote: > Ralph, > > > i noticed dist-graph/distgraph_test_4 from the ibm test suite fails when > using a hostfile and running no task on the host running mpirun. > > n0$ mpirun --host n1:1,n2:1 -np 2 ./dist-graph/distgraph_test_4 > > > the root cause is OPAL_PMIX_NODEID is correctly set ( 0, 1, 2) by mpirun, but > for some reasons, orted sets it to -1 everywhere. > > an indirect consequence is a crash of the test (it believes tasks run on zero > distinct nodes instead of 2) > > > this occurs only master, and v2.x is fine. > > > Could you please have a look ? > > > Cheers, > > > Gilles > > _______________________________________________ > devel mailing list > devel@lists.open-mpi.org <mailto:devel@lists.open-mpi.org> > https://rfd.newmexicoconsortium.org/mailman/listinfo/devel > <https://rfd.newmexicoconsortium.org/mailman/listinfo/devel> > > _______________________________________________ > devel mailing list > devel@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/devel
_______________________________________________ devel mailing list devel@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/devel