Re: [OMPI devel] [devel-core] [RFC] Runtime Services Layer

Tim Prins Fri, 24 Aug 2007 09:50:02 -0400

George Bosilca wrote:

Looks like I'm the only one barely excited about this idea. Thesystem that you described, is well known. It been around for around10 years, and it's called PMI. The interface you have in the tmpbranch as well as the description you gave in your email are morethan similar with what they sketch in the following two documents:
http://www-unix.mcs.anl.gov/mpi/mpich/developer/design/pmiv2draft.htm
http://www-unix.mcs.anl.gov/mpi/mpich/developer/design/pmiv2.htm

Yes, I am well acquainted with these documents, and the PMI did providea lot of inspiration for the RSL.

Now, there is something wrong with reinventing the wheel if there areno improvements. And so far I'm unable to notice any majorimprovement neither compared with PMI nor with what we have today(except maybe being able to use PMI inside Open MPI).

This is true. The RSL is designed to handle exactly what we need rightnow. This does not mean that the interface cannot be extended later. Thecurrent RSL is a starting point.

Again, my main concern is about fault tolerance. There is nothing inPMI (and nothing in RSL so far) that allow any kind of faulttolerance [And believe me re-writing the MPICH mpirun to allowcheckpoint/restart is a hassle].

I am open to any extensions that are needed. Again, the current versionis designed as a starting point. Also, I have been talking a lot withJosh and the current RSL is more than enough to supportcheckpoint/restart as currently implemented. I would be interested intalking about any additions that are needed.

Moreover, your approach seems toopen the possibility of having heterogeneous RTE (in terms offeatures) which in my view is definitively the wrong approach.

Do you mean having different RTEs that support different features?Personally I do not see this as a horrible thing. In fact, we alreadydeal with this problem, since different systems support differentthings. For instance, we support comm_spawn on most systems, but not all.

I do not understand why a user should have to use a RTE which supportsevery system ever imagined, and provides every possible fault-tolerantfeature, when all they want is a thin RTE.

Tim

   george.

On Aug 16, 2007, at 9:47 PM, Tim Prins wrote:
WHAT: Solicitation of feedback on the possibility of adding a runtime
services layer to Open MPI to abstract out the runtime.
WHY: To solidify the interface between OMPI and the runtimeenvironment,
and to allow the use of different runtime systems, including different
versions of ORTE.

WHERE: Addition of a new framework to OMPI, and changes to many of the
files in OMPI to funnel all runtime request through this framework.Few
changes should be required in OPAL and ORTE.
WHEN: Development has started in tmp/rsl, but is still in itsinfancy. We hope
to have a working system in the next month.

TIMEOUT: 8/29/07

------
Short version:
I am working on creating an interface between OMPI and the runtimesystem.This would make a RSL framework in OMPI which all runtime serviceswould be
accessed from. Attached is a graphic depicting this.

This change would be invasive to the OMPI layer. Few (if any) changes
will be required of the ORTE and OPAL layers.

At this point I am soliciting feedback as to whether people are
supportive or not of this change both in general and for v1.3.


Long version:

The current model used in Open MPI assumes that one runtime system is
the best for all environments. However, in many environments it may be
beneficial to have specialized runtime systems. With our currentsystem this
is not easy to do.

With this in mind, the idea of creating a 'runtime services layer' was
hatched. This would take the form of a framework within OMPI,through which
all runtime functionality would be accessed. This would allow new or
different runtime systems to be used with Open MPI. Additionally,with such asystem it would be possible to have multiple versions of open rtecoexisting,which may facilitate development and testing. Finally, this wouldsolidify the
interface between OMPI and the runtime system, as well as provide
documentation and side effects of each interface function.

However, such a change would be fairly invasive to the OMPI layer, and
needs a buy-in from everyone for it to be possible.
Here is a summary of the changes required for the RSL (at least howit is
currently envisioned):
1. Add a framework to ompi for the rsl, and a component to supportorte.
2. Change ompi so that it uses the new interface. This involves:
         a. Moving runtime specific code into the orte rsl component.
         b. Changing the process names in ompi to an opaque object.
         c. change all references to orte in ompi to be to the rsl.
3. Change the configuration code so that open-rte is only linkedwhere needed.
Of course, all this would happen on a tmp branch.
The design of the rsl is not solidified. I have been playing in atmp branch(located at https://svn.open-mpi.org/svn/ompi/tmp/rsl) whicheveryone is
welcome to look at and comment on, but be advised that things here are
subject to change (I don't think it even compiles right now). Thereare
some fairly large open questions on this, including:

1. How to handle mpirun (that is, when a user types 'mpirun', do they
always get ORTE, or do they sometimes get a system specificruntime). Mostlikely mpirun will always use ORTE, and alternative launchingprograms would
be used for other runtimes.
2. Whether there will be any performance implications. My guess isnot,
but am not quite sure of this yet.
Again, I am interested in people's comments on whether they thinkaddingsuch abstraction is good or not, and whether it is reasonable to dosuch a
thing for v1.3.

Thanks,
Tim Prins<RSL-Diagram.pdf>_______________________________________________
devel-core mailing list
devel-c...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel-core
_______________________________________________
devel-core mailing list
devel-c...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel-core

Re: [OMPI devel] [devel-core] [RFC] Runtime Services Layer

Reply via email to