This helped me track down a leaked file descriptor, but I think the
order of events is not desirable.  If an MPIIO file descriptor is not
closed before MPI_Finalize, I get the following.


*** An error occurred in MPI_Barrier
*** after MPI was finalized
*** MPI_ERRORS_ARE_FATAL (your MPI job will now abort)
[brakk:1193] Abort after MPI_FINALIZE completed successfully; not able to 
guarantee that all other processes were killed!
[Switching to Thread 0x7fa523b78710 (LWP 1193)]

Breakpoint 2, 0x00007fa51ed39a20 in exit () from /lib/libc.so.6
(gdb) bt
#0  0x00007fa51ed39a20 in exit () from /lib/libc.so.6
#1  0x00007fa520ff6613 in ompi_mpi_abort () from /usr/lib/libmpi.so.0
#2  0x00007fa520fe59b7 in ompi_mpi_errors_are_fatal_comm_handler () from 
/usr/lib/libmpi.so.0
#3  0x00007fa52100acb2 in PMPI_Barrier () from /usr/lib/libmpi.so.0
#4  0x00007fa52106638a in mca_io_romio_dist_MPI_File_close () from 
/usr/lib/libmpi.so.0
#5  0x00007fa520feaa2e in file_destructor () from /usr/lib/libmpi.so.0
#6  0x00007fa520fea7c1 in ompi_file_finalize () from /usr/lib/libmpi.so.0
#7  0x00007fa520ff7496 in ompi_mpi_finalize () from /usr/lib/libmpi.so.0
#8  0x00007fa5233bc2d1 in PetscFinalize () at pinit.c:897
#9  0x0000000000402091 in main (argc=1, args=0x7fff70f1f498) at ex5.c:72



Open MPI 1.3.3, GCC-4.4.0
Linux brakk 2.6.30-ARCH #1 SMP PREEMPT Fri Jun 19 20:44:03 UTC 2009 x86_64 
Intel(R) Core(TM)2 Duo CPU T9300 @ 2.50GHz GenuineIntel GNU/Linux

Jed

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to