Greg and I are chatting off list; there's something definitely weird going on in his setup.
We'll report back to the list when we figure it out. On Jan 24, 2014, at 1:26 PM, Gus Correa <g...@ldeo.columbia.edu> wrote: > On 01/24/2014 12:50 PM, Fischer, Greg A. wrote: >> Yep. That was the problem. It works beautifully now. >> >> Thanks for prodding me to take another look. >> >> With regards to openmpi-1.6.5, the system that I'm compiling and running on, > SLES10, contains some pretty dated software (e.g. Linux 2.6.x, python 2.4, > gcc 4.1.2). Is it possible there's simply an > incompatibility lurking in there somewhere that would trip > openmpi-1.6.5 but not openmpi-1.4.3? >> >> Greg >> > > Hi Greg > > FWIW, we have OpenMPI 1.6.5 installed > (and we have used OMPI 1.4.5, 1.4.4, 1.4.3, ..., 1.2.8, before) > in our older cluster that has CentOS 5.2, Linux kernel 2.6.18, > gcc 4.1.2, Python 2.4.3, etc. > Parallel programs compile and run with OMPI 1.6.5 without problems. > > I hope this helps, > Gus Correa > >>> -----Original Message----- >>> From: Fischer, Greg A. >>> Sent: Friday, January 24, 2014 11:41 AM >>> To: 'Open MPI Users' >>> Cc: Fischer, Greg A. >>> Subject: RE: [OMPI users] simple test problem hangs on mpi_finalize and >>> consumes all system resources >>> >>> Hmm... It looks like CMAKE was somehow finding openmpi-1.6.5 instead of >>> openmpi-1.4.3, despite the environment variables being set otherwise. This >>> is likely the explanation. I'll try to chase that down. >>> >>>> -----Original Message----- >>>> From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Jeff >>>> Squyres (jsquyres) >>>> Sent: Friday, January 24, 2014 11:39 AM >>>> To: Open MPI Users >>>> Subject: Re: [OMPI users] simple test problem hangs on mpi_finalize and >>>> consumes all system resources >>>> >>>> Ok. I only mention this because the "mca_paffinity_linux.so: undefined >>>> symbol: mca_base_param_reg_int" type of message is almost always an >>>> indicator of two different versions being installed into the same tree. >>>> >>>> >>>> On Jan 24, 2014, at 11:26 AM, "Fischer, Greg A." >>>> <fisch...@westinghouse.com> wrote: >>>> >>>>> Version 1.4.3 and 1.6.5 were and are installed in separate trees: >>>>> >>>>> 1003 fischega@lxlogin2[~]> ls >>>>> /tools/casl_sles10/vera_clean/gcc-4.6.1/toolset/openmpi-1.* >>>>> /tools/casl_sles10/vera_clean/gcc-4.6.1/toolset/openmpi-1.4.3: >>>>> bin etc include lib share >>>>> >>>>> /tools/casl_sles10/vera_clean/gcc-4.6.1/toolset/openmpi-1.6.5: >>>>> bin etc include lib share >>>>> >>>>> I'm fairly sure I was careful to check that the LD_LIBRARY_PATH was >>>>> set >>>> correctly, but I'll check again. >>>>> >>>>>> -----Original Message----- >>>>>> From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Jeff >>>>>> Squyres (jsquyres) >>>>>> Sent: Friday, January 24, 2014 11:07 AM >>>>>> To: Open MPI Users >>>>>> Subject: Re: [OMPI users] simple test problem hangs on mpi_finalize >>>>>> and consumes all system resources >>>>>> >>>>>> On Jan 22, 2014, at 10:21 AM, "Fischer, Greg A." >>>>>> <fisch...@westinghouse.com> wrote: >>>>>> >>>>>>> The reason for deleting the openmpi-1.6.5 installation was that I >>>>>>> went back >>>>>> and installed openmpi-1.4.3 and the problem (mostly) went away. >>>>>> Openmpi- >>>>>> 1.4.3 can run the simple tests without issue, but on my "real" >>>>>> program, I'm getting symbol lookup errors: >>>>>>> >>>>>>> mca_paffinity_linux.so: undefined symbol: mca_base_param_reg_int >>>>>> >>>>>> This sounds like you are mixing 1.6.x and 1.4.x in the same >>>>>> installation >>>> tree. >>>>>> This can definitely lead to sadness. >>>>>> >>>>>> More specifically: installing 1.6 over an existing 1.4 installation >>>>>> (and vice >>>>>> versa) is definitely NOT supported. The set of plugins that the two >>>>>> install are different, and can lead to all manner of weird/undefined >>>> behavior. >>>>>> >>>>>> FWIW: I typically install Open MPI into a tree by itself. And if I >>>>>> later want to remove that installation, I just "rm -rf" that tree. >>>>>> Then I can install a different version of OMPI into that same tree >>>>>> (because the prior tree is completely gone). >>>>>> >>>>>> However, if you can't install OMPI into a tree by itself, you can >>>>>> "make uninstall" from the source tree, and that should surgically >>>>>> completely remove OMPI from the installation tree. Then it is safe >>>>>> to install a different version of OMPI into that same tree. >>>>>> >>>>>> Can you verify that you had installed OMPI into completely clean >>>>>> trees? If you didn't, I can imagine that causing the kinds of >>>>>> errors that you >>>> described. >>>>>> >>>>>> -- >>>>>> Jeff Squyres >>>>>> jsquy...@cisco.com >>>>>> For corporate legal information go to: >>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>>> >>>>>> _______________________________________________ >>>>>> users mailing list >>>>>> us...@open-mpi.org >>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> users mailing list >>>>> us...@open-mpi.org >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>>> >>>> >>>> -- >>>> Jeff Squyres >>>> jsquy...@cisco.com >>>> For corporate legal information go to: >>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>> >>>> _______________________________________________ >>>> users mailing list >>>> us...@open-mpi.org >>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>>> >> >> >> _______________________________________________ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/