On Dec 13, 2010, at 6:08 PM, David Fiala wrote:

> I noticed that I can get the openib transport to work successfully
> under version 1.4.3 when configured with: --with-openib
> --enable-openib-ibcm

I'm surprised; the IBCM connection manager support in OMPI is not complete.  
You probably shouldn't be using it.

> When I configure 1.5 or 1.5.1 I used: --with-openib (noting the
> absence of the ibmc flag)

Good.

> However, when I actually try to use openib on a basic MPI program I
> get a segfault such as the one copied below.
> 
> Our IB hardware is:
> InfiniBand: Mellanox Technologies MT26428 [ConnectX VPI PCIe 2.0 5GT/s
> - IB QDR / 10GigE] (rev b0)

Ick.  Mellanox, can you reply?

> dfiala@compute-0-2 ~]$ mpirun -mca btl openib,self  ./mpitest/mpitest
> [compute-0-2:07582] *** Process received signal ***
> [compute-0-2:07582] Signal: Segmentation fault (11)
> [compute-0-2:07582] Signal code: Address not mapped (1)
> [compute-0-2:07582] Failing at address: 0x2
> [compute-0-2:07582] [ 0] /lib64/libpthread.so.0 [0x3ed2e0eb10]
> [compute-0-2:07582] [ 1] /usr/lib64/libmlx4-rdmav2.so [0x2aaaab0de5d1]
> [compute-0-2:07582] [ 2]
> /home/dfiala/openmpi/install-1.5.1/lib/openmpi/mca_btl_openib.so
> [0x2b1637155f15]
> [compute-0-2:07582] [ 3]
> /home/dfiala/openmpi/install-1.5.1/lib/openmpi/mca_bml_r2.so
> [0x2b163691b4b2]
> [compute-0-2:07582] [ 4]
> /home/dfiala/openmpi/install-1.5.1/lib/openmpi/mca_pml_ob1.so
> [0x2b1636d3844f]
> [compute-0-2:07582] [ 5] /home/dfiala/openmpi/install/lib/libmpi.so.1
> [0x2b16347afe37]
> [compute-0-2:07582] [ 6]
> /home/dfiala/openmpi/install/lib/libmpi.so.1(MPI_Init+0xf0)
> [0x2b16347c46d0]
> [compute-0-2:07582] [ 7] ./mpitest/mpitest(main+0x2b) [0x4008d3]
> [compute-0-2:07582] [ 8] /lib64/libc.so.6(__libc_start_main+0xf4) 
> [0x3ed261d994]
> [compute-0-2:07582] [ 9] ./mpitest/mpitest [0x4007f9]
> [compute-0-2:07582] *** End of error message ***
> --------------------------------------------------------------------------
> mpirun noticed that process rank 0 with PID 7582 on node
> compute-0-2.local exited on signal 11 (Segmentation fault).
> --------------------------------------------------------------------------
> 
> Any ideas?
> 
> Thanks for your help,
> David Fiala
> 
> North Carolina State University
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to