Hi Adrian and Andrew, I"m able to reproduce your problem on one of our qlogic clusters. We are using PSM 1.14 and slurm. I'm noticing that for some reason in our setup the ORTE_MCA_orte_precondition_transports env. variable is not being set.
Could you run your test with --mca odls_base_verbose 100 and check to see that in fact that env. variable isn't in the list of passed env. variables? Would one of you mind opening an issue to track this problem? Thanks, Howard 2015-01-09 7:52 GMT-07:00 Friedley, Andrew <[email protected]>: > No this is not expected behavior. > > The PSM MTL code has not changed in 2 months, when I fixed that unused > variable warning for you. That suggests something above the PSM MTL broke > things. I see no reason your older software install should suddenly > stopping working if all you are updating is OMPI master -- at least with > respect to PSM anyway. > > The error message is right, it's not possible to open more than one > context per process. This hasn't changed. It does indicate that maybe > something is causing the MTL to be opened twice in each process? > > Andrew > > > -----Original Message----- > > From: devel [mailto:[email protected]] On Behalf Of Adrian > > Reber > > Sent: Friday, January 9, 2015 4:13 AM > > To: [email protected] > > Subject: [OMPI devel] Changed behaviour with PSM on master > > > > Running the mpi_test_suite on master used to work with no problems. At > > some point in time it stopped working however and now I get only error > > messages from PSM: > > > > """ > > n050301:3.0.In PSM version 1.14, it is not possible to open more than > one > > context per process > > > > [n050301:26526] Open MPI detected an unexpected PSM error in opening an > > endpoint: In PSM version 1.14, it is not possible to open more than one > > context per process """ > > > > I know that I do not have the newest version of the PSM library and that > I > > need to update the library but as this requires many software packages > to be > > re-compiled we are trying to avoid it on our CentOS6 based system. > > > > My main question (probably for Andrew) is if this is an expected > behaviour > > on master. It works on 1.8.x and it used to work on master at least > until 2014- > > 12-08. > > > > This is the last MTT entry for working PSM (with my older version) > > http://mtt.open-mpi.org/index.php?do_redir=2226 > > > > and since a few days it fails on master > > http://mtt.open-mpi.org/index.php?do_redir=2225 > > > > On another system (RHEL7) with newer PSM libraries there is no such > error. > > > > Adrian > _______________________________________________ > devel mailing list > [email protected] > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2015/01/16766.php >
