Re: [OMPI users] openMPI 1.1.4 - connect() failed with errno=111

2007-02-12 Thread Jeff Squyres
I'm assuming that these are Linux hosts. If so, errno 111 is "connection refused" possibly meaning that there is still some firewall active or the wrong interface is being used to establish connections between these machines. Can you send the output of "ifconfig" (might be /sbin/ifconfig o

Re: [OMPI users] openMPI 1.1.4 - connect() failed with errno=111

2007-02-12 Thread Matteo Guglielmi
This is the ifconfig output from the machine I'm used to submit the parallel job: ### ifconfig output - master node ### [root@lcbcpc02 ~]# ifconfig eth0 Link encap:Ethernet HWaddr 00:15:17:10:53:C8 inet addr:128.178.54.74 Bcast:128.178.54.255 Mask:255.255.255.0 inet6

Re: [OMPI users] openMPI 1.1.4 - connect() failed with errno=111

2007-02-12 Thread Jeff Squyres
On Feb 12, 2007, at 12:54 PM, Matteo Guglielmi wrote: This is the ifconfig output from the machine I'm used to submit the parallel job: It looks like both of your nodes share an IP address: [root@lcbcpc02 ~]# ifconfig eth1 Link encap:Ethernet HWaddr 00:15:17:10:53:C9 inet add

Re: [OMPI users] openMPI 1.1.4 - connect() failed with errno=111

2007-02-12 Thread Matteo Guglielmi
Jeff Squyres wrote: > On Feb 12, 2007, at 12:54 PM, Matteo Guglielmi wrote: > > >> This is the ifconfig output from the machine I'm used to submit the >> parallel job: >> > > It looks like both of your nodes share an IP address: > > >> [root@lcbcpc02 ~]# ifconfig >> eth1 Link encap:

Re: [OMPI users] openMPI 1.1.4 - connect() failed with errno=111

2007-02-12 Thread Jeff Squyres
On Feb 12, 2007, at 2:34 PM, Matteo Guglielmi wrote: Those nic "eth1" are not connected at all... all the machines use only the eth0 interface which have different IP for each PC. Gotcha. But, FWIW, OMPI doesn't know that because they have valid IP addresses. So it thinks they're on the

Re: [OMPI users] openMPI 1.1.4 - connect() failed with errno=111

2007-02-12 Thread Matteo Guglielmi
Jeff Squyres wrote: > On Feb 12, 2007, at 2:34 PM, Matteo Guglielmi wrote: > > >> Those nic "eth1" are not connected at all... all the machines use >> only the eth0 >> interface which have different IP for each PC. >> > > Gotcha. But, FWIW, OMPI doesn't know that because they have valid

Re: [OMPI users] NetPipe benchmark & spanning multiple interconnects

2007-02-12 Thread Alex Tumanov
Anyone else who may provide some feedback/comments on this issue? How typical/widespread is the use of multiple interconnects in the HPC community? Judging from the feedback I'm getting in this thread, it appears that this is fairly uncommon? Thanks for your attention to this thread. Alex. On 2