Dear All

I have just been trying out SCALI MPI with Valgrind and have observed that
when running across multiple nodes the process seem to end up spinning in
MPI_Init. Smaller problems running with shared memory appear fine.

The configuration is: 

8 Nodes connected by Infiniband
2 Sockets/Node with 1 MPI process each
Plenty of spare memory
No competing processes – memcheck is at 100%

My hunch is there is weird packet dropping effect but hoping someone out
there has an idea. Was hoping I could tell Valgrind to skip instrumenting
libmpi.so? Either I didn’t read the manual well enough or it breaks a
fundamental principle (which I suspect it might)

Thanks

Dominic

Schlumberger
Abingdon Technology Center
Direct Tel:               +44 (0) 1235 857869
Switchboard Tel:  +44 (0) 1235 559595
----------------------------------------------------------------------------
--------------------------------------------------
Registered Name: Schlumberger Oilfield UK PLC Registered Office: 8th Floor,
South Quay Plaza 2, 183 Marsh Wall, London. E14 9SH Registered in England
No. 4157867
----------------------------------------------------------------------------
--------------------------------------------------



------------------------------------------------------------------------------
_______________________________________________
Valgrind-users mailing list
Valgrind-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/valgrind-users

Reply via email to