I'm assuming that these are Linux hosts. If so, errno 111 is
"connection refused" possibly meaning that there is still some
firewall active or the wrong interface is being used to establish
connections between these machines.
Can you send the output of "ifconfig" (might be /sbin/ifconfig o
This is the ifconfig output from the machine I'm used to submit the
parallel job:
### ifconfig output - master node ###
[root@lcbcpc02 ~]# ifconfig
eth0 Link encap:Ethernet HWaddr 00:15:17:10:53:C8
inet addr:128.178.54.74 Bcast:128.178.54.255 Mask:255.255.255.0
inet6
On Feb 12, 2007, at 12:54 PM, Matteo Guglielmi wrote:
This is the ifconfig output from the machine I'm used to submit the
parallel job:
It looks like both of your nodes share an IP address:
[root@lcbcpc02 ~]# ifconfig
eth1 Link encap:Ethernet HWaddr 00:15:17:10:53:C9
inet add
Jeff Squyres wrote:
> On Feb 12, 2007, at 12:54 PM, Matteo Guglielmi wrote:
>
>
>> This is the ifconfig output from the machine I'm used to submit the
>> parallel job:
>>
>
> It looks like both of your nodes share an IP address:
>
>
>> [root@lcbcpc02 ~]# ifconfig
>> eth1 Link encap:
On Feb 12, 2007, at 2:34 PM, Matteo Guglielmi wrote:
Those nic "eth1" are not connected at all... all the machines use
only the eth0
interface which have different IP for each PC.
Gotcha. But, FWIW, OMPI doesn't know that because they have valid IP
addresses. So it thinks they're on the
Jeff Squyres wrote:
> On Feb 12, 2007, at 2:34 PM, Matteo Guglielmi wrote:
>
>
>> Those nic "eth1" are not connected at all... all the machines use
>> only the eth0
>> interface which have different IP for each PC.
>>
>
> Gotcha. But, FWIW, OMPI doesn't know that because they have valid
Anyone else who may provide some feedback/comments on this issue? How
typical/widespread is the use of multiple interconnects in the HPC
community? Judging from the feedback I'm getting in this thread, it
appears that this is fairly uncommon?
Thanks for your attention to this thread.
Alex.
On 2