Re: [OMPI users] Ompi failing on mx only

2007-01-09 Thread Grobe, Gary L. (JSC-EV)[ESCG]
> I need it's the backtrace on the process which generate the > segfault. Second, in order to understand the backtrace, it's > better to have run debug version of Open MPI. Without the > debug version we only see the address where the fault occur > without having access to the line number ...

Re: [OMPI users] Ompi failing on mx only

2007-01-08 Thread Grobe, Gary L. (JSC-EV)[ESCG]
> >> PS: Is there any way you can attach to the processes with gdb ? I > >> would like to see the backtrace as showed by gdb in order > to be able > >> to figure out what's wrong there. > > I found out that all processes on the 2nd node crash so I just put a 30 second wait before MPI_Init in

Re: [OMPI users] Ompi failing on mx only

2007-01-08 Thread Grobe, Gary L. (JSC-EV)[ESCG]
> >> PS: Is there any way you can attach to the processes with gdb ? I > >> would like to see the backtrace as showed by gdb in order > to be able > >> to figure out what's wrong there. > > > > When I can get more detailed dbg, I'll send. Though I'm not > clear on > > what executable is being

Re: [OMPI users] Ompi failing on mx only

2007-01-08 Thread Grobe, Gary L. (JSC-EV)[ESCG]
I was wondering if someone could send me the HACKING file so I can do a bit more with debugging on the snapshots. Our web proxy has webdav methods turned off (request methods fail) so that I can't get to the latest of the svn repos. > Second thing. From one of your previous emails, I see that MX

Re: [OMPI users] Ompi failing on mx only

2007-01-05 Thread Grobe, Gary L. (JSC-EV)[ESCG]
Ok, sorry about that last. I think someone just bumped up the required version of Automake. -Original Message- From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of Grobe, Gary L. (JSC-EV)[ESCG] Sent: Friday, January 05, 2007 2:29 PM To: Open MPI Users Subject

Re: [OMPI users] Ompi failing on mx only

2007-01-05 Thread Grobe, Gary L. (JSC-EV)[ESCG]
This is just an FYI of the Jan 5th snapshot. I'll send a backtrace of the processes as soon as I get a b3 running. Between my filtered webdav svn access problems and the latest nightly snapshots, my builds are currently failing where the same config lines worked on previous snapshots ...

Re: [OMPI users] Ompi failing on mx only

2007-01-04 Thread Grobe, Gary L. (JSC-EV)[ESCG]
abilities are reduced. Further, the MTL is not well suited to interconnects like TCP or InfiniBand, so we will continue supporting the BTL interface as well. Brian On Jan 2, 2007, at 2:44 PM, Grobe, Gary L. ((JSC-EV))[ESCG] wrote: > About the -x, I've been trying it both ways and pre

Re: [OMPI users] Ompi failing on mx only

2007-01-03 Thread Grobe, Gary L. (JSC-EV)[ESCG]
the BTL interface as well. Brian On Jan 2, 2007, at 2:44 PM, Grobe, Gary L. ((JSC-EV))[ESCG] wrote: > About the -x, I've been trying it both ways and prefer the latter, and > results for either are the same. But it's value is correct. > I've attached the ompi_info from node-1 and node

Re: [OMPI users] Ompi failing on mx only

2007-01-02 Thread Grobe, Gary L. (JSC-EV)[ESCG]
I'm losing it today, I just now noticed I sent mx_info for the wrong nodes ... // node-1 $ mx_info MX Version: 1.1.6 MX Build: ggrobe@juggernaut:/home/ggrobe/Tools/mx-1.1.6 Thu Nov 30 14:17:44 GMT 2006 1 Myrinet board installed. The MX driver is configured to support up to 4 instances and 1024

Re: [OMPI users] Ompi failing on mx only

2007-01-02 Thread Grobe, Gary L. (JSC-EV)[ESCG]
Ah, sorry about that ... $ ./mx_info MX Version: 1.1.6 MX Build: ggrobe@juggernaut:/home/ggrobe/Tools/mx-1.1.6 Thu Nov 30 14:17:44 GMT 2006 1 Myrinet board installed. The MX driver is configured to support up to 4 instances and 1024 nodes.

[OMPI users] Ompi failing on mx only

2007-01-02 Thread Grobe, Gary L. (JSC-EV)[ESCG]
I was initially using 1.1.2 and moved to 1.2b2 because of a hang on MPI_Bcast() which 1.2b2 reports to fix, and seemed to have done so. My compute nodes are 2 dual core xeons on myrinet with mx. The problem is trying to get ompi running on mx only. My machine file is as follows ... node-1