This helped me track down a leaked file descriptor, but I think the order of events is not desirable. If an MPIIO file descriptor is not closed before MPI_Finalize, I get the following.
*** An error occurred in MPI_Barrier *** after MPI was finalized *** MPI_ERRORS_ARE_FATAL (your MPI job will now abort) [brakk:1193] Abort after MPI_FINALIZE completed successfully; not able to guarantee that all other processes were killed! [Switching to Thread 0x7fa523b78710 (LWP 1193)] Breakpoint 2, 0x00007fa51ed39a20 in exit () from /lib/libc.so.6 (gdb) bt #0 0x00007fa51ed39a20 in exit () from /lib/libc.so.6 #1 0x00007fa520ff6613 in ompi_mpi_abort () from /usr/lib/libmpi.so.0 #2 0x00007fa520fe59b7 in ompi_mpi_errors_are_fatal_comm_handler () from /usr/lib/libmpi.so.0 #3 0x00007fa52100acb2 in PMPI_Barrier () from /usr/lib/libmpi.so.0 #4 0x00007fa52106638a in mca_io_romio_dist_MPI_File_close () from /usr/lib/libmpi.so.0 #5 0x00007fa520feaa2e in file_destructor () from /usr/lib/libmpi.so.0 #6 0x00007fa520fea7c1 in ompi_file_finalize () from /usr/lib/libmpi.so.0 #7 0x00007fa520ff7496 in ompi_mpi_finalize () from /usr/lib/libmpi.so.0 #8 0x00007fa5233bc2d1 in PetscFinalize () at pinit.c:897 #9 0x0000000000402091 in main (argc=1, args=0x7fff70f1f498) at ex5.c:72 Open MPI 1.3.3, GCC-4.4.0 Linux brakk 2.6.30-ARCH #1 SMP PREEMPT Fri Jun 19 20:44:03 UTC 2009 x86_64 Intel(R) Core(TM)2 Duo CPU T9300 @ 2.50GHz GenuineIntel GNU/Linux Jed
signature.asc
Description: OpenPGP digital signature