Dear EasyBuilders,
I am facing a problem with the OpenMPI/1.4.5-GCC-4.6.3-no-OFED module. It 
doesn't work with some software packages (NWChem for example) and gives and 
error message similar to this one.

[comp023.local][[32496,1],72][btl_tcp_endpoint.c:638:mca_btl_tcp_endpoint_complete_connect]
 connect() to 192.168.30.24 failed: Connection refused (111)

I have installed the same OpenMPI version manually and it worked fine. Also, I 
have installed another version of OpenMPI using EasyBuild and it worked fine. 
The importance of this module comes from that this is the one included in the 
goalf-1.1 module which became very popular on our system at BA and we always 
use it to build our software.

What I understand from this error is that some network interfaces are note 
reachable by the MPI. On the other hand, those interfaces are working fine with 
the other versions of MPI and doesn't give any error. I don't know if this is 
relevant or not but it is reporting this error on the InfiniBand network 
(192.168.30.0 subnet).

Any ideas regarding troubleshooting or solutions to this problem.

Best Regards,
Mohammed Gaafar
HPC System Administrator
Supercomputer Project
International School of Information Science
Bibliotheca Alexandrina
Tel: +20 3 4839999 Ext.: 1453
Cell: +201061822670 / +201117223299

Reply via email to