Hi devels,

In fact the communicator's group was already retained in the window structure. So everything was already in place. I pushed the last modifications, and everything seems ready to be merged in PR#4527.

Jeff, the fixup commits are squashed :)

Clément


On 11/30/2017 12:00 AM, Barrett, Brian via devel wrote:
The group is the easiest way to do the mapping from rank in window to ompi_proc_t, so it’s safe to say every window will have one (also, as a way of holding a reference to the ompi_proc_t).  So I think it’s safe to say that every OSC module has a group handle somewhere (directly or through the communicator).

Remember that in some implementations of the MTL, a communicator ID is a precious resource.  I don’t know where Portals 4 falls right now, but in various of the 64 bit tag matching implementations, it’s been as low as 4k communicators.  There’s no need for a cid if all you hold is a group reference.  Plus, a communicator has a bunch of other state (collective modules handles, etc.) that aren’t necessarily needed by a window.

Brian

On Nov 29, 2017, at 5:57 AM, Clement FOYER <clement.fo...@gmail.com <mailto:clement.fo...@gmail.com>> wrote:

Hi Brian,

Even if I see your point, I don't think a user request de free the communicator should necesserily lead to the communicator being deleted, only released from one hold, and available to be disposed by the library. I don't see objection to have the library keep a grab on these communicators, as the user give a handle to the actual object.

I do agree the point of asking if we want to keep only information relevant to all OSC components. Nevertheless, what would the difference be between holding the complete communicator and holding the group only? Is group the smallest part common to every component?

Clément


On 11/28/2017 07:46 PM, Barrett, Brian via devel wrote:
The following is perfectly legal:

MPI_Comm_dup(some_comm, &tmp_comm);
MPI_Win_create(…., tmp_comm, &window);
MPI_Comm_free(tmp_comm);

<use window>

So I don’t think stashing away a communicator is the solution.  Is a group sufficient?  I think any rational reading of the standard would lead to windows needing to hold a group reference for the life of the window.  I’d be ok putting a group pointer in the base window, if that would work?

Brian

On Nov 28, 2017, at 10:19 AM, George Bosilca <bosi...@icl.utk.edu <mailto:bosi...@icl.utk.edu>> wrote:

Hi Brian,

Let me first start with explaining why we need the communicator. We need to translate local to global rank (aka. rank in your MPI_COMM_WORLD), so that the communication map we provide make sense. The only way today is to go back to a communicator and then basically translate a rank between this communicator and MPI_COMM_WORLD. We could use the gid, but then we have a hash table lookup for every operation.

While a communicator is not needed internally by an OSC, in MPI world all windows start with a communicator. This is the reason why I was proposing the change, not to force a window to create or hold a communicator, but simply because the existence of a communicator linked to the window is more of less enforced by the MPI standard.

  George.



On Tue, Nov 28, 2017 at 1:02 PM, Barrett, Brian via devel <devel@lists.open-mpi.org <mailto:devel@lists.open-mpi.org>> wrote:

    The objection I have to this is that it forces an
    implementation where every one-sided component is backed by a
    communicator.  While that’s the case today, it’s certainly not
    required.  If you look at Portal 4, for example, there’s one
    collective call outside of initialization, and that’s a barrier
    in MPI_FENCE.  The SM component is the same way and given some
    of the use cases for shared memory allocation using the SM
    component, it’s very possible that we’ll be faced with a
    situation where creating a communicator per SM region is too
    expensive in terms of overall communicator count.

    I guess a different question would be what you need the
    communicator for.  It shouldn’t have any useful semantic
    meaning, so why isn’t a silent implementation detail for the
    monitoring component?

    Brian


    On Nov 28, 2017, at 8:45 AM, George Bosilca
    <bosi...@icl.utk.edu <mailto:bosi...@icl.utk.edu>> wrote:

    Devels,

    We would like to change the definition of the OSC module to
    move the communicator one level up from the different module
    structures into the base OSC module. The reason for this, as
    well as a lengthy discussion on other possible solutions can
    be found in https://github.com/open-mpi/ompi/pull/4527
    <https://github.com/open-mpi/ompi/pull/4527>.

    We need to take a decision on this asap, to prepare the PR for
    the 3.1. Please comment asap.

      George.

    _______________________________________________
    devel mailing list
    devel@lists.open-mpi.org <mailto:devel@lists.open-mpi.org>
    https://lists.open-mpi.org/mailman/listinfo/devel
    <https://lists.open-mpi.org/mailman/listinfo/devel>


    _______________________________________________
    devel mailing list
    devel@lists.open-mpi.org <mailto:devel@lists.open-mpi.org>
    https://lists.open-mpi.org/mailman/listinfo/devel
    <https://lists.open-mpi.org/mailman/listinfo/devel>





_______________________________________________
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

_______________________________________________
devel mailing list
devel@lists.open-mpi.org <mailto:devel@lists.open-mpi.org>
https://lists.open-mpi.org/mailman/listinfo/devel



_______________________________________________
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

_______________________________________________
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

Reply via email to