If you wouldn't mind, could you try it again after applying the attached patch? This looks like a problem we encountered on another release where something in the runtime didn't get initialized early enough. It only shows up in certain circumstances, but this seems to fix it.
You can apply the patch by going to the top directory where you installed the code, saving the patch, and then "patch -p0 < patch.txt". You will then just need to do another "make install". If this solves your problem, I will have the patch incorporated into the 1.1.2 release code. Thanks Ralph On 10/26/06 1:54 PM, "Daniel Vollmer" <ma...@maven.de> wrote: > Hi all, > > I've compiled open-mpi 1.1.2 in 64bit mode (using XCode 2.4 / i686- > apple-darwin8-gcc-4.0.1 (GCC) 4.0.1 (Apple Computer, Inc. build > 5363)) with > ./configure --prefix=/usr/local/openmpi-1.1.2 --enable-debug CFLAGS=- > m64 CXXFLAGS=-m64 OBJCFLAGS=-m64 LDLFLAGS=-m64 > on an Intel Mac Pro (with Xeon 51XX processors) on Mac OS 10.4.8. > Everything builds fine and results in proper 64bit libraries and > executables. Unfortunately, when attempting to run something as > simple as > /usr/local/openmpi-1.1.2/bin/mpirun ls > it crashes (and hangs) with a NULL pointer dereference after outputting > [Sonnenblume.local:25036] opal_ifinit: unable to find network > interfaces. > > gdb shows the following: > Sonnenblume:~/Development/tau/openmpi-1.1.2 maven$ gdb /usr/local/ > openmpi-1.1.2/bin/mpirun > GNU gdb 6.3.50-20050815 (Apple version gdb-563) (Wed Jul 19 05:10:58 > GMT 2006) > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and > you are > welcome to change it and/or distribute copies of it under certain > conditions. > Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for > details. > This GDB was configured as "i386-apple-darwin"...Reading symbols for > shared libraries .... done > > (gdb) run ls > Starting program: /usr/local/openmpi-1.1.2/bin/mpirun ls > Reading symbols for shared libraries .+++ done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > [Sonnenblume.local:25051] opal_ifinit: unable to find network > interfaces. > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > Reading symbols for shared libraries . done > > Program received signal EXC_BAD_ACCESS, Could not access memory. > Reason: KERN_INVALID_ADDRESS at address: 0x0000000000000000 > 0x0000000000000000 in ?? () > (gdb) bt > #0 0x0000000000000000 in ?? () > #1 0x000000010040851c in orte_init_stage1 (infrastructure=true) at > runtime/orte_init_stage1.c:267 > #2 0x000000010040c727 in orte_system_init (infrastructure=true) at > runtime/orte_system_init.c:41 > #3 0x0000000100407eea in orte_init (infrastructure=true) at runtime/ > orte_init.c:48 > #4 0x0000000100000e20 in orterun (argc=2, argv=0x7fff5fbffbc0) at > orterun.c:329 > #5 0x0000000100000cc1 in main (argc=2, argv=0x7fff5fbffbc0) at > main.c:13 > > > Any ideas / advice? > > Thanks, > Daniel. > > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users
patch.txt
Description: Binary data