Re: [OMPI users] "Connection to lifeline" with openmpi-1.4.5

2012-03-03 Thread Barnet Wagman
Is the job completing? Usually this message appears because mpirun terminates before everything else does. Only concern I have is that the process that issued your example message is an application process, but I'm assuming it was running local to mpirun - yes? No the job is not

Re: [OMPI users] "Connection to lifeline" with openmpi-1.4.5

2012-03-03 Thread Ralph Castain
On Mar 1, 2012, at 10:47 PM, Barnet Wagman wrote: > I've run into a problem upgrading from 1.4.3 to 1.4.4 or 1.4.5 > > With 1.4.4 and 1.4.5, I'm getting error messages like > > [[59597,1],0] routed:binomial: Connection to lifeline [[59597,0],0] lost > > The error does not occur if I restrict

[OMPI users] "Connection to lifeline" with openmpi-1.4.5

2012-03-02 Thread Barnet Wagman
I've run into a problem upgrading from 1.4.3 to 1.4.4 or 1.4.5 With 1.4.4 and 1.4.5, I'm getting error messages like [[59597,1],0] routed:binomial: Connection to lifeline [[59597,0],0] lost The error does not occur if I restrict the host list to localhost. Basic tests like 'mpirun hello_c'