Hello,
 
I try to profile a parallel code on a large cluster
(8cores per node, Intel_MPI & Open_MPI available)
with mpirun ... valgrind --tool=callgrind --base=$PWD/callgrind.out ./exe
 
++ Everything seems fine when I profile from 1 to 8 cores
 (1 node) and CPU times is ok (10min with 8cores) 
- But, I get some empty callgrind.out.* files when I profile from 16 cores 
to XXX (and computing on different nodes) ;
- And CPU times dramatically increase to ~1day since 16cores
 
What would you suggest?
Exclude all warnings with --gen-suppressions?
Network options?
Switch-off mpi-profiling? How?
Try to reinstall valgrind?
 
Thanks for your attention,
 
Fabien



------------------------------------------------------------------------------
Free Software Download: Index, Search & Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
_______________________________________________
Valgrind-users mailing list
Valgrind-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/valgrind-users

Reply via email to