Folks,

some time ago, some crashes were reported when using java bindings.
one of them was caused was caused by mca_mtl_psm.so.
the root cause is libinfinipath.so initializer sets its own signal handler, which
conflicts with the signal handler sets by the jvm.
the only workaround is to disable the psm mtl
(e.g. mpirun --mca mtl ^psm ...)
since mpirun --mca mtl_psm_priority 0 ... does not work
(libinfinipath.so is loaded, so the initializer is ran and the signal handlers are set)
so the psm mtl cannot be disabled by the Java MPI_Init()

one option is to document this
an other option is not to build the psm mtl if java bindings are built
and an other option is to revamp mca_mtl_psm.so so it does not link with libinfinipath.so
(use an intermediate component, or dlopen libinfinipath)

any thoughts ?

Cheers,

Gilles

Reply via email to