[OMPI devel] Fwd: OpenMPI changes

Greg Watson Tue, 4 Mar 2008 15:24:24 -0500

Hi all,

Ralph informs me that significant functionality has been removed fromORTE in 1.3. Unfortunately this functionality was being used by PTP toprovide support for OMPI, and without it, it seems unlikely that PTPwill be able to work with 1.3. Apparently restoring this lostfunctionality is an "enhancement" of 1.3, and so is something thatwill not necessarily be done. Having worked with OMPI from a veryearly stage to ensure that we were able to provide robust support, Imust say it is a bit disappointing that this approach is being taken.I hope that the community will view this "enhancement" as worthwhile.


Regards,

Greg

Begin forwarded message:

On 2/29/08 7:13 AM, "Gregory R Watson" <g...@us.ibm.com> wrote:

>
>
> Ralph Castain <r...@lanl.gov> wrote on 02/29/2008 12:18:39 AM:
>
>> Ralph Castain <r...@lanl.gov>
>> 02/29/08 12:18 AM
>>
>> To
>>
>> Gregory R Watson/Watson/IBM@IBMUS
>>
>> cc
>>
>> Subject
>>
>> Re: OpenMPI changes
>>
>> Hi Greg
>>
>> All of the prior options (and some new ones) for spawning a jobare fully>> supported in the new interface. Instead of setting them with"attributes",>> you create an orte_job_t object and just fill them in. This isprecisely how>> mpirun does it - you can look at that code if you want anexample, though it>> is somewhat complex. Alternatively, you can look at the way it isdone for>> comm_spawn, which may be more analogous to your situation - thatcode is in
>> ompi/mca/dpm/orte.
>>
>> All the tools library does is communicate the job object to thetarget>> persistent daemon so it can do the work. This way, you don't haveto open
>> all the frameworks, deal directly with the plm interface, etc.
>>
>> Alternatively, you are welcome to do a full orte_init and use theframeworks>> yourself - there is no requirement to use the library. I onlyoffer it as an
>> alternative.
>
> As far as I can tell, neither API provides the same functionalityas that> available in 1.2. While this might be beneficial for OMPI-specificactivities,> the changes appear to severely limit the interaction of tools withthe
> runtime. At this point, I can't see either interface supporting PTP.
I went ahead and added a notification capability to the system -took about30 minutes. I can provide notice of job and process state changessince Isee those. Node state changes, however, are different - I can notifyonthem, but we have no way of seeing them. None of the environments wesupport
tell us when a node fails.

>
>>
>> I know that the tool library works because it uses the identicalAPIs as>> comm_spawn and mpirun. I have also tested them by building my owntools.
>
> There's a big difference being on a code path that *must* workbecause it is> used by core components, to one that is provided as an add-on forexternal
> tools. I may be worrying needlessly if this new interface becomes an
> "officially supported" API. Is that planned? At a minimum, itseems like it's> going to complicate your testing process, since you're going toneed to> provide a separate set of tests that exercise this interfaceindependent of
> the rest of OMPI.
It is an officially supported API. Testing is not as big a problemas youmight expect since the library exercises the same code paths asmpirun and
comm_spawn. Like I said, I have written my own tools that exercise the
library - no problem using them as tests.

>
>>
>> We do not launch an orted for any tool-library query. All we do is
>> communicate the query to the target persistent daemon or mpirun.Those>> entities have recv's posted to catch any incoming messages andexecute the
>> request.
>>
>> You are correct that we no longer have event driven notificationin the>> system. I repeatedly asked the community (on both devel and corelists) for>> input on that question, and received no indications that anyonewanted it>> supported. It can be added back into the system, but wouldrequire the>> approval of the OMPI community. I don't know how problematic thatwould be ->> there is a lot of concern over the amount of memory, overhead,and potential>> reliability issues that surround event notification. If you wantthat>> capability, I suggest we discuss it, come up with a plan thatdeals with>> those issues, and then take a proposal to the devel list fordiscussion.
>>
>> As for reliability, the objectives of the last year's effort wereprecisely>> scalability and reliability. We did a lot of work to eliminaterecursive>> deadlocks and improve the reliability of the code. Our currenttesting>> indicates we had considerable success in that regard,particularly with the
>> recursion elimination commit earlier today.
>>
>> I would be happy to work with you to meet the PTP's needs - we'lljust need>> to work with the OMPI community to ensure everyone buys into theplan. If it>> would help, I could come and review the new arch with the team (Ialready
>> gave a presentation on it to IBM Rochester MN) and discuss required
>> enhancements.
>
> PTP's needs have not changed since 1.0. From our perspective, the1.3 branch> simply removes functionality that is required for PTP to supportOMPI. It> seems strange that we need "approval of the OMPI community" tocontinue to use> functionality that has been available since 1.0. In any case,there are> unfortunately no resources to work on the kind of re-engineeringthat appears> to be required to support 1.3, even if it did provide thefunctionality we
> need.
Afraid I have to be driven by the OMPI community's requirementssince theypay my salary :-) What they need is a "lean, mean, OMPI machine" astheysay, and (for some reason) they view the debugger community asconsisting offolks like totalview, vampirtrace, etc. - all of whom get involved(eitherdirectly or via one of the OMPI members) in the requirementsdiscussions.
Can't argue with business decisions, though. I gather there was somementionof PTP at the recent LANL/IBM RR meeting, so I'll let people knowthat PTP
won't be an option on RR.
And I'll see if there is any interest here in adding 1.3 support toPTPourselves - from looking at your code, I think it would take about aday,
assuming someone more familiar with PTP will work with me.

Take care
Ralph

>
> Greg
>

[OMPI devel] Fwd: OpenMPI changes

Reply via email to