On Thu, 30 Apr 2009, Ralph Castain wrote:
well, that's only because the code's doing something it shouldn't.
 Have a look at comm_cid.c:185 - there's the check we added to the
multi-threaded case (which was the only case when we added it).
 The cid generation should never generate a number larger than
mca_pml.pml_max_contextid. I'm actually somewhat amazed this fails
gracefully, as OB1 doesn't appear to check it got a valid cid in
add_comm, which it should probably do.

Actually, as an FYI: it doesn't fail gracefully. It just hangs...ick.

Ah.  Patch to change the hang into an abort coming RSN.

Brian

Reply via email to