What Ken put in is what is needed for the limited multi-cluster capabilities we need, just one additional string. I don't think there is a need for any discussion of such a small change.
Rich On 9/22/08 1:32 PM, "Ralph Castain" <r...@lanl.gov> wrote: > We really should discuss that as a group first - there is quite a bit > of code required to actually support multi-clusters that has been > removed. > > Our operational model that was agreed to quite a while ago is that > mpirun can -only- extend over a single "cell". You can connect/accept > multiple mpiruns that are sitting on different cells, but you cannot > execute a single mpirun across multiple cells. > > Please keep this on your own development branch for now. Bringing it > into the trunk will require discussion as this changes the operating > model, and has significant code consequences when we look at abnormal > terminations, comm_spawn, etc. > > Thanks > Ralph > > On Sep 22, 2008, at 11:26 AM, Richard Graham wrote: > >> This check in was in error - I had not realized that the checkout >> was from >> the 1.3 branch, so we will fix this, and put these into the trunk >> (1.4). We >> are going to bring in some limited multi-cluster support - limited >> is the >> operative word. >> >> Rich >> >> >> On 9/22/08 12:50 PM, "Jeff Squyres" <jsquy...@cisco.com> wrote: >> >>> I notice that Ken Matney (the committer) is not on the devel list; I >>> added him explicitly to the CC line. >>> >>> Ken: please see below. >>> >>> >>> On Sep 22, 2008, at 12:46 PM, Ralph Castain wrote: >>> >>>> Whoa! We made a decision NOT to support multi-cluster apps in OMPI >>>> over a year ago! >>>> >>>> Please remove this from 1.3 - we should discuss if/when this would >>>> even be allowed in the trunk. >>>> >>>> Thanks >>>> Ralph >>>> >>>> On Sep 22, 2008, at 10:35 AM, mat...@osl.iu.edu wrote: >>>> >>>>> Author: matney >>>>> Date: 2008-09-22 12:35:54 EDT (Mon, 22 Sep 2008) >>>>> New Revision: 19600 >>>>> URL: https://svn.open-mpi.org/trac/ompi/changeset/19600 >>>>> >>>>> Log: >>>>> Added member to orte_node_t to enable multi-cluster jobs in ALPS >>>>> scheduled systems (like Cray XT). >>>>> >>>>> Text files modified: >>>>> branches/v1.3/orte/runtime/orte_globals.h | 4 ++++ >>>>> 1 files changed, 4 insertions(+), 0 deletions(-) >>>>> >>>>> Modified: branches/v1.3/orte/runtime/orte_globals.h >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> = >>>>> =================================================================== >>>>> --- branches/v1.3/orte/runtime/orte_globals.h (original) >>>>> +++ branches/v1.3/orte/runtime/orte_globals.h 2008-09-22 12:35:54 >>>>> EDT (Mon, 22 Sep 2008) >>>>> @@ -222,6 +222,10 @@ >>>>> /** Username on this node, if specified */ >>>>> char *username; >>>>> char *slot_list; >>>>> + /** Clustername (machine name of cluster) on which this node >>>>> + resides. ALPS scheduled systems need this to enable >>>>> + multi-cluster support. */ >>>>> + char *clustername; >>>>> } orte_node_t; >>>>> ORTE_DECLSPEC OBJ_CLASS_DECLARATION(orte_node_t); >>>>> >>>>> _______________________________________________ >>>>> svn mailing list >>>>> s...@open-mpi.org >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/svn >>>> >>>> _______________________________________________ >>>> devel mailing list >>>> de...@open-mpi.org >>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>> >> >> _______________________________________________ >> devel mailing list >> de...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > > _______________________________________________ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel