As the error message says, one possible cause is the connection being blocked by a firewall.

Another possible cause is a ssh passwordless access problem:

https://stackoverflow.com/questions/19565795/unable-to-execute-mpich2-on-multiple-machines-on-ubuntu-12-04-hydu-sock-connect

Yet, another possible cause is a problem resolving the DNS hostname:

https://forums.suse.com/archive/index.php/t-6057.html
https://www.slothparadise.com/running-mpi-common-mpi-troubleshooting-problems/

Since /etc/hosts usually cannot be edited by a user, the cluster administrator would have to fix the hosts file if that happens to be the source of the problem.

On 7/18/2018 6:07 PM, karima Physique wrote:
Dear wien2k users:

Using the folowing machines files :
lapw0:master:12
dstart:master:12
1:master:12
1:node1:12
1:node2:12
......
the calculation works very well, but using the following machines file:
lapw0:master:12 node1:12 node2:12
dstart:master:12 node1:12 node2:12
1:master:12
1:node1:12
1:node2:12
.......
I got the following error:

unable to get host adress calcul.local for (1)
unable to connect to server calcul.local at port 44295 (chek for firewalls!)
we note that calcul.local is the host to connect to w2web.
I ask you any suggestions to solve this problem
_______________________________________________
Wien mailing list
Wien@zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
SEARCH the MAILING-LIST at:  
http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html

Reply via email to