Re: [OMPI devel] BTL move - the notion

Jeff Squyres Thu, 11 Dec 2008 13:54:38 -0500

(chiming in a bit after the fact)

In general, I agree with most of what has been stated.

1. The BTLs should remain "owned" by Open MPI. There are OMPI memberorganizations in multiple projects that want to use the BTLs, but theBTLs are primarily for the Open MPI project.

2. An incremental patch approach would likely be best; my definitionof that would be "small branch and merge". I strongly endorse hg orgit for this; they are *VERY* good at exactly this kind of thing.Much, much larger code bases than OMPI pervasively use hg/git for thebranch/patch/merge model with very good success. If you "grew up" onCVS/SVN (and earlier), this may seem counter-intuitive -- but pleaserealize that tools have evolved significantly since then.

3. Moving the BTL code to different parts of the source tree won'tmatter much in terms of performance and (mostly) abstractions. But weshould check, just to make sure we didn't muck something up. This isa complex code base, after all.

4. Adding new functionality to the BTL (e.g., bootstrapping) issubject to #1.

5. Ralph outlined the case for tighter integration with the RTE andthe BTLs. I think it's reasonable, and I agree with his case. We canadd abstractions to ensure that nothing is ORTE-specific and to ensurethat we can safely handle if some other underlying RTE doesn't havethe same capabilities (none of this stuff is likely to be in theperformance-critical code path, so it's not too much of an issue).But allowing other RTE's under the OMPI MPI layer shouldn't restrictwhat we want/can do with our own OMPI-specific RTE.


Just my $0.00000000000002....



On Dec 5, 2008, at 11:10 AM, Richard Graham wrote:

> think we all agree that STCI and OMPI have different objectivesand requirements. OMPI is facing the need to launch and operate atextreme scales by next summer, has received a lot of interest inhaving it report errors into various systems, etc. We don't haveall the answers as to what will be necessary to meet theserequirements, but indications so far are that tighter integration,not deeper abstraction, between the various layers will be needed.By that, I don't mean we will violate abstraction layers, butrather that the various layers need to work more as a tightly tunedinstrument, with each layer operating based on a clear knowledge ofhow the other layers are functioning.
OMPI and STCI are two different things together, and I have vestedinterest in both, and have no desireto have either go south. You have a set of requirement at LANLwhich areimportant, and we also have a set of requirement at ORNL, and assuch we need to compromise on thesein the code base. We have MPI level goals, which will beaccomplished in the OMPI code base, andtools and other related goals that will be accomplished in othercode bases.We both have the need to function well at the high end, so have thesame set
of goals there.

>
> For example, for modex-less operations, the MPI/BTLs have to knowthat the RTE/OS will be providing certain information. This meansthat they don't have to go out and discover it themselves everytime. Yes, we will leave that as the default behavior so that smalland/or unmanaged clusters can operate, but we have to also introducelogic that can detect when we are utilizing this alternativecapability and exploit it. While we are trying our best to avoidintroducing RTE-like calls into the code, the fact is that we maywell have to do so (we have already identified one btl that willdefinitely need to). It is simply too early to make the decision tocut that off now - we don't know what the long-term impacts of sucha decision will be.
This is where discussions will need to go both ways. Your changesalso can impact us, and we need to agreeto those changes, just as much as you need to agree with the changeswe are proposing. This is not a codebase focused on a single institution's requirements, and we all doour best (and I believe tend to
succeed) at helping meet all of our needs.

>
> Finally, although I don't do much on the MPI layer, I am concernedabout performance. I would tend to oppose any additional abstractionuntil we can measure the performance impact. Thus, I would like tosee the BTL move done on a tmp branch (technology to branch up tothe implementer - I don't care) so we can verify that it isn'thurting us in some unforeseeable manner.
Agreed - at least for the last phase of what we are suggesting, butwe can talk about this. I am a bitconfused about how the location of the source code has anything todo with how it performs at run-time.At this stage we have said nothing about changing the way the btlworks, just cosmetic things. When itcomes to enabling the use of stci with ompi, then these issues willcome up, and need to be addressedvery carefully. To be honest, since we don't want to change thebtl's (aside from add some attributes)I don't expect this to be an issue, UNLESS we end up needing tochange some data structures for abstractionpurposes. This is where we need to be very careful. If you look atwhat has happened with the btl's(actually first the PTL's) historically, I have been one of the onespushing hard for improved performance -
why would this change now ?

>
>
> So I guess my concerns really boil down to dealing withconflicting schedules and requirements, how to support multiplepossibly competing groups that want to share one or more parts ofour code base, and retaining an OMPI-first philosophy when it comesto what changes get made. My proposed solution is:
This is the problem we face all the time, and on a regular basis weas a community do our best to helpeach other out. This is one of the reasons 1.3 is as late as it is,and this is a good thing that will
continue as long as this is a community project.

>
> 1. shift our repository to a technical solution that supportsbroader code sharing
>
> 2. have the non-OMPI groups access our code base via thattechnology. They can "pull" changes at will, subject to thelicensing agreement. It is true that they may have to do some localediting if the change hits a spot where they have local mods tosupport their system, but both Hg and GIT are very good at handlingthis - much better than svn ever has been.
>
> 3. if there are minor mods required to make the BTL code areaeasier to share via the above methods, then we should explore andimplement them. Certainly, renaming #define values would seem a no-brainer. I suspect there are other similar things that could bedone. Removing orte/opal dependencies is more controversial andwould need to thoroughly be examined.
>
> 4. OMPI decides what changes get made to its code base. We arepolite about it and talk to the other groups to try and minimizeimpact, but ultimately we do what is best for OMPI, and send outnotifications (perhaps a new mailing list specifically for thatpurpose) when changes occur. Note that this would have helped theEclipse group enormously as otherwise they drown in the devel listtrying to spot the changes.
I don't see that anything else is being proposed. The emerging STCIcommunity and the OMPI community arenot two non-overlapping groups, and run-time support we want tobring into OMPI is to support newfunctionality. The main point is that this is not STCI vs. OMPI atall.
Rich

>
> My $0.0002 - hope it helps
> Ralph
>
>
> On Dec 4, 2008, at 6:00 PM, Richard Graham wrote:
>
> Let me start the e-mail conversation, and see how far we get.
>
> Goal: The goal several of us have is to be able to use the btl‚Äôsoutside of the MPI layer in Open MPI. The layer itself is generic,w/o specific knowledge of Upper Level Protocols, so is well suitedfor this sort of use.
>
> Technical Approach: What we have suggested is to start the processwith the Open MPI code base, and make it independent of the mpi-layer (which it is now), and the run-time layer.
>
> Before we get into any specific technical details,
> the first question I have is are people totally opposed to thenotion of making the btl‚Äôs independent of MPI and the run-time ?> This does not mean that it can‚Äôt be used by it, but that thereare well defined abstraction layers, i.e., are people against thegoal in the first place ?
>
> What are alternative suggestions to the technical approach ?
>
> One suggestion has been to branch and patch. To me this is a long-term maintenance nightmare.
>
> What are peoples thoughts here ?
>
> Rich
>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>
> Ôøº
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel



--
Jeff Squyres
Cisco Systems

Re: [OMPI devel] BTL move - the notion

Reply via email to