Hi Mondrian, On Mon, 02 Nov 2009 13:22:11 +0100 Mondrian Nuessle <nues...@uni-hd.de> wrote: > > If I turn on mpi_leave_pinned (and thus the registration cache is > actually used), I see occasional memory corruption issues for example > when I call MPI_Allreduce often. > > Debugging with valgrind did not lead to any clues, since OMPI refuses > to run in that case. If I turn off mpi_leave_pinned, everything seems > to be fine. > > I tested on version 1.3.3 and 1.3.4rc1. > > Do you have any suggestions how to investigate this situation?
Have you got OMPI_ENABLE_DEBUG defined? The symptoms of what you are seeing sound like what might happen if debug is off and you trigger an issue I posted about here related to thread safety of mpool. http://www.open-mpi.org/community/lists/devel/2009/10/7042.php If OMPI_ENABLE_DEBUG is defined it will abort if pthread_mutex_lock returns EDEADLK (see opal_mutex_lock), but if not, the code proceeds without a lock which could cause memory corruption. Chris -- cy...@au.ibm.com