which off-list are we talking about?
very annoying.

2015-08-25 10:38 GMT-06:00 Ralph Castain <r...@open-mpi.org>:

> We’re looking at this off-list. It would be preferable not to disable PSM
> if we can avoid it
>
> On Aug 25, 2015, at 9:32 AM, Nathaniel Graham <nrgraha...@gmail.com>
> wrote:
>
> What if we modify the mpirun script to include the --mca mtl ^psm tag if
> java is in the run string?
>
> -Nathan
>
> On Tue, Aug 25, 2015 at 9:47 AM, Howard Pritchard <hpprit...@gmail.com>
> wrote:
>
>> I'll update the java FAQ.
>>
>> 2015-08-25 8:36 GMT-06:00 Jeff Squyres (jsquyres) <jsquy...@cisco.com>:
>>
>>> On Aug 25, 2015, at 10:00 AM, Howard Pritchard <hpprit...@gmail.com>
>>> wrote:
>>> >
>>> > I think rather than trying workarounds of dubious robustness inside
>>> open mpi we
>>> >
>>> > - dicument the issue on either the somewhat aged open mpi website faq
>>> or add it to a wiki page on github
>>>
>>> It should probably be documented in the README and the FAQ.
>>>
>>> I'd be against adding user documentation to the wiki -- this would be a
>>> 3rd place for users to look for information.
>>>
>>> > - file a bug against  intel psm
>>>
>>> I'd like to hear what they have to say first... :-)
>>>
>>> >
>>> > ----------
>>> >
>>> > sent from my smart phonr so no good type.
>>> >
>>> > Howard
>>> >
>>> > On Aug 25, 2015 6:02 AM, "Gilles Gouaillardet" <
>>> gilles.gouaillar...@gmail.com> wrote:
>>> > i do not know if this can be runtime detected ...
>>> > note we should report this to intel folks and ask them to advise.
>>> > ideally, they would provide a way to make sure libinfinipath.so does
>>> not conflict with the jvm signal handlers.
>>> >
>>> > my idea is to dlopen libinfinipath only if java bindings are not used.
>>> >
>>> > On Tuesday, August 25, 2015, Jeff Squyres (jsquyres) <
>>> jsquy...@cisco.com> wrote:
>>> > Is it possible to run-time detect this situation?  E.g., probe the
>>> signal handler, or somesuch.
>>> >
>>> > Rationale: I'd rather have something run-time disabled than not built.
>>> >
>>> > Would dlopen'ing libinfinipath change actually change its signal
>>> handler behavior?
>>> >
>>> >
>>> > > On Aug 25, 2015, at 4:27 AM, Gilles Gouaillardet <gil...@rist.or.jp>
>>> wrote:
>>> > >
>>> > > Folks,
>>> > >
>>> > > some time ago, some crashes were reported when using java bindings.
>>> > > one of them was caused was caused by mca_mtl_psm.so.
>>> > > the root cause is libinfinipath.so initializer sets its own signal
>>> handler, which
>>> > > conflicts with the signal handler sets by the jvm.
>>> > > the only workaround is to disable the psm mtl
>>> > > (e.g. mpirun --mca mtl ^psm ...)
>>> > > since mpirun --mca mtl_psm_priority 0 ... does not work
>>> > > (libinfinipath.so is loaded, so the initializer is ran and the
>>> signal handlers are set)
>>> > > so the psm mtl cannot be disabled by the Java MPI_Init()
>>> > >
>>> > > one option is to document this
>>> > > an other option is not to build the psm mtl if java bindings are
>>> built
>>> > > and an other option is to revamp mca_mtl_psm.so so it does not link
>>> with libinfinipath.so
>>> > > (use an intermediate component, or dlopen libinfinipath)
>>> > >
>>> > > any thoughts ?
>>> > >
>>> > > Cheers,
>>> > >
>>> > > Gilles
>>> > > _______________________________________________
>>> > > devel mailing list
>>> > > de...@open-mpi.org
>>> > > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> > > Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2015/08/17838.php
>>> >
>>> >
>>> > --
>>> > Jeff Squyres
>>> > jsquy...@cisco.com
>>> > For corporate legal information go to:
>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>> >
>>> > _______________________________________________
>>> > devel mailing list
>>> > de...@open-mpi.org
>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> > Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2015/08/17840.php
>>> >
>>> > _______________________________________________
>>> > devel mailing list
>>> > de...@open-mpi.org
>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> > Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2015/08/17841.php
>>> > _______________________________________________
>>> > devel mailing list
>>> > de...@open-mpi.org
>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> > Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2015/08/17845.php
>>>
>>>
>>> --
>>> Jeff Squyres
>>> jsquy...@cisco.com
>>> For corporate legal information go to:
>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>>
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2015/08/17847.php
>>>
>>
>>
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2015/08/17849.php
>>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/08/17851.php
>
>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/08/17852.php
>

Reply via email to