Jeff Squyres wrote:
> On Feb 12, 2007, at 2:34 PM, Matteo Guglielmi wrote:
>
>
>> Those nic "eth1" are not connected at all... all the machines use
>> only the eth0
>> interface which have different IP for each PC.
>>
>
> Gotcha. But, FWIW, OMPI doesn't know that because they have valid
On Feb 12, 2007, at 2:34 PM, Matteo Guglielmi wrote:
Those nic "eth1" are not connected at all... all the machines use
only the eth0
interface which have different IP for each PC.
Gotcha. But, FWIW, OMPI doesn't know that because they have valid IP
addresses. So it thinks they're on the
Jeff Squyres wrote:
> On Feb 12, 2007, at 12:54 PM, Matteo Guglielmi wrote:
>
>
>> This is the ifconfig output from the machine I'm used to submit the
>> parallel job:
>>
>
> It looks like both of your nodes share an IP address:
>
>
>> [root@lcbcpc02 ~]# ifconfig
>> eth1 Link encap:
On Feb 12, 2007, at 12:54 PM, Matteo Guglielmi wrote:
This is the ifconfig output from the machine I'm used to submit the
parallel job:
It looks like both of your nodes share an IP address:
[root@lcbcpc02 ~]# ifconfig
eth1 Link encap:Ethernet HWaddr 00:15:17:10:53:C9
inet add
This is the ifconfig output from the machine I'm used to submit the
parallel job:
### ifconfig output - master node ###
[root@lcbcpc02 ~]# ifconfig
eth0 Link encap:Ethernet HWaddr 00:15:17:10:53:C8
inet addr:128.178.54.74 Bcast:128.178.54.255 Mask:255.255.255.0
inet6
I'm assuming that these are Linux hosts. If so, errno 111 is
"connection refused" possibly meaning that there is still some
firewall active or the wrong interface is being used to establish
connections between these machines.
Can you send the output of "ifconfig" (might be /sbin/ifconfig o
Since I've installed openmpi I cannot submit any job that uses cpus from
different machines.
### hostfile ###
lcbcpc02.epfl.ch slots=4 max-slots=4
lcbcpc04.epfl.ch slots=4 max-slots=4
### error message ###
[matteo@lcbcpc02 TEST]$ mpirun --hostfile ~matteo/hostfile -np 8
/home/mat