I have experienced problems running many MPI processes concurrently.
Some of the MPI processes run fine (the first started) while the others
hang or have very very slow progress.

I have dual socket quad core SUN 2200 nodes and Mellanox cards.
Se below. 

I have tried the OFED 1.2.5 stack and the OFED 1.4rc3 stack.


Any suggestions about settings or increments of buffers, tokens etc is
welcome.

An example :
Barrier benchmark :
Barrier size  9 iterations 32768 [8 procs - Resolution 0.95us]
9 nodes 12186.93 us

A barrier using 9 nodes should not take 12 milliseconds.
One barrier normally takes 11.20 microseconds using 9 nodes.


Some background information :

Stack: OFED 1.4rc3
Card : InfiniBand: Mellanox Technologies: Unknown device 634a (rev a0)


Best regards,
Ole W. Saastad




-- 
Ole W. Saastad, dr. scient.
Scientific Computing Group, USIT, University of Oslo
http://hpc.uio.no

_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to