Does this include a fix for the problem I reported with mpirun-hosted processes?
If not I would ask that we holding off on putting it into the trunk until that particular bug is addressed. From my experience tackling this particular issues requires some code refactoring, which should probably be done once in the trunk instead of two possibly disruptive commits. -- Josh On Fri, Jun 17, 2011 at 5:18 PM, Wesley Bland <wbl...@eecs.utk.edu> wrote: > This is a reminder that the Resilient ORTE RFC is set to go into the trunk > on Monday at COB. > I've updated the code with a few of the changes that were mentioned on and > off the list (moved code out of orted_comm.c, errmgr_set_callback returns > previous callback, post_startup function, corrected normal termination > issues). Please take another look at it if you have any interest. The code > can be found here: > https://bitbucket.org/wesbland/resilient-orte/ > Thanks, > Wesley Bland -- Joshua Hursey Postdoctoral Research Associate Oak Ridge National Laboratory http://users.nccs.gov/~jjhursey