Re: [OMPI devel] RFC: Component-izing MPI_Op

Brian W. Barrett Mon, 5 Jan 2009 10:09:41 -0500

I think this sounds reasonable, if (and only if) MPI_Accumulate isproperly handled. The interface for calling the op functions was brokenin some fairly obvious way for accumulate when I was writing the one-sidedcode. I think I had to call some supposedly internal bits of theinterface to make accumulate work. I can't remember what they are now,but I do remember it being a problem.

Of course, unless it makes mpi_allreduce on one double-sized floatingpoint number using sum go faster, I'm not entirely sure a change ishelpful ;).


Brian

On Mon, 5 Jan 2009, Jeff Squyres wrote:

WHAT: Converting the back-end of MPI_Op's to use components instead ofhard-coded C functions.
WHY: To support specialized hardware (such as GPUs).

WHERE: Changes most of the MPI_Op code, adds a new ompi/mca/op framework.
WHEN: Work has started in an hg branch(http://www.open-mpi.org/hg/hgwebdir.cgi/jsquyres/cuda/).
TIMEOUT: Next Tuesday's teleconference, Jan 13 2008.

---------------------------------------
Note: I don't plan to finish the work by Jan 13; I just want to get a yea/nayfrom the community on the concept. Final review of the code before cominginto the trunk can come later when I have more work to show / review.
Background: Today, the back-end MPI_Op functionality of (MPI_Op,MPI_Datatype) tuples are implemented as function pointers to a series ofhard-coded C functions in the ompi/op/ directory.
*** NOTE: Since we already implement MPI_Op functionality via functionpointer, this proposed extension is not expected to cause any performancedifference in terms of OMPI's infrastructure.
Proposal: Extend the current implementation by creating a new framework("op") that allows components to provide back-end MPI_Op functions insteadof/in addition to the hard-coded C functions (we've talked about this ideabefore, but never done it).
The "op" framework will be similar to the MPI coll framework in thatindividual function pointers from multiple different modules can bemixed-n-matched. For example, if you want to write a new coll component thatimplements *only* a new MPI_BCAST algorithm, that coll component can bemixed-n-matched with other coll components at run time to get a full set ofcollective implementations on a communicator. A similar concept will beapplied to the "op" framework. Case in point: some specialized hardware isonly good at *some* operations on *some* datatypes; we'll need to fall backto the hard-coded C versions for all other tuples.
It is likely that the the "op" framework base will have all the hard-coded C"basic" MPI_Op functions that will always be available for fallback if acomponent is not used at run-time for a specialized implementation.Specifically: the intent is that components will be for specializedimplementations.

Re: [OMPI devel] RFC: Component-izing MPI_Op

Reply via email to