Hi,
On my sl51 (32-bit) boxes mpif90 objects the the 'integer : : ierror'
line. Maybe my compiler is feeling odd...
Replacing it with 'integer ierror' lets it compile for me.
I have no problem with the compilation. The problems occur only when
trying to run the resulting executable file.
If I run the result I get no hang, but then i get not much useful
output either :-)
$ mpirun ./x
libibverbs: Fatal: couldn't read uverbs ABI version.
--------------------------------------------------------------------------
[0,1,0]: OpenIB on host unfair.damtp.cam.ac.uk was unable to find any
HCAs.
Another transport will be used instead, although this may result in
lower performance.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
[0,1,0]: uDAPL on host unfair.damtp.cam.ac.uk was unable to find any
NICs.
Another transport will be used instead, although this may result in
lower performance.
--------------------------------------------------------------------------
That piece of code I wrote on my previous message doesn't output
anything. If the executable is successfully executed, then it outputs no
data, but shouldn't hang. It just makes use of the mpi library.
In case it matters that box has:
$ rpm -q sl-release gcc-gfortran openmpi openmpi-devel
sl-release-5.1-2.i386
gcc-gfortran-4.1.2-14.el5.i386
openmpi-1.2.3-4.el5.i386
openmpi-devel-1.2.3-4.el5.i386
The problem seem to be on the OpenIB support. I compiled OpenMPI from
source without OpenIB support as well as I've built a new RPM without
the OpenIB support and I have no problems when executing the file.
Debugging mpi problems is always a bit of a nightmare. BTW how many
processors were you running it on?
I'm still worrying about updating our systems to the (newer) openmpi
from sl52 as the package maintainers have switched from alternatives
(which I sort of understand) to mpi-selector (which I don't)...
I run it just on a dual-core desktop machine. I tried with SL5.2
packages as well but I had the same problems.
Cheers,
Panos
--
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.