I noticed that when btl_remove_procs is invoked on the openib BTL
(e.g., when you "mpirun --mca btl self,openib ...", an openib endpoint
will be removed because self's exclusivity will edge it out), the
openib remove_procs() function will not remove the corresponding
endpoint on mca_btl_ope
We now use the errmgr.
Aurelien
Le 6 mars 08 à 13:38, Aurélien Bouteiller a écrit :
Aside of what Josh said, we are working right know at UTK on orted/MPI
recovery (without killing/respawning all). For now we had no use of
the errgmr, but I'm quite sure this would be the smartest place to
put
Looks like it works.
Aurelien
Le 6 mars 08 à 10:36, Ralph Castain a écrit :
I believe I have at least helped reduce this with r17761. I added the
ability for procs to detect that their "lifeline" connection (either
the HNP
for unity routed, or their local daemon for tree) has been lost and
I filed this as https://svn.open-mpi.org/trac/ompi/ticket/1233 so that
it would not be forgotten.
On Feb 18, 2008, at 10:53 AM, Tim Prins wrote:
Hi all,
This is a bit strange, so I thought I'd ping the group before digging
any further.
The onesided test 't_win' is failing for us (specifica