Hi Joseph,

Firstly, I could argue that the implementation problem is caused by an 
implementation choice: too-lazy initialisation. The MPI Standard requires that 
an answer be given at a certain point in the execution of the program, so the 
implementation must do whatever work is needed to provide the answer at the 
time it is needed. The initialisation work could be lazily delayed until the 
user actually gets the INFO object associated with the communicator (if they 
ever do). I don’t particularly like that argument because I don’t like 
excluding this implementation route without deliberate intent to do so (I 
assert that no such deliberate intent exists).

Secondly, I could argue that the implementation should simply document its 
handling of INFO keys in this scenario: it is implementation-specific 
behaviour. There are lots of ways to phrase that documentation. a) For example, 
an INFO key will not be ignored if any module reports during communicator 
creation that it understands that key, even if that module is never actually 
selected or used. b) For example, when non-standard INFO keys are supplied to a 
communicator creation procedure, module selection for all operation types will 
occur during communicator creation instead of being delayed until yada yada ya. 
c) For example, calling MPI_COMM_GET_INFO will cause module selection to occur 
if non-standard INFO keys were associated with the communicator during 
communicator creation or subsequently via MPI_COMM_SET_INFO.

Thirdly, I would quite like the answer to be: INFO keys that only apply to some 
instances of MPI operations (that covers, but is not identical to, “are 
understood/used by some modules”) should be applied to the MPI operation rather 
than to the communicator. This is possible with a persistent scatter operation 
(for example) but, through an oversight in the MPI Standard, it is not possible 
for a blocking or nonblocking scatter operation. Thus, unfortunately, this 
answer is somewhat flawed. Despite that flaw, implementation-specific 
documentation could encourage the user to replace blocking and nonblocking 
collective operations with persistent operations and to supply the INFO keys to 
the persistent collective initialization procedure(s) instead of applying them 
to the whole communicator. Similar advice could encourage a switch from 
point-to-point operations to partitioned communication with one partition.

Cheers,
Dan.
—
Dr Daniel Holmes PhD
Executive Director
Chief Technology Officer
CHI Ltd
danhol...@chi.scot



> On 20 Oct 2021, at 20:24, Joseph Schuchart via mpi-forum 
> <mpi-forum@lists.mpi-forum.org> wrote:
> 
> I am working on the fix for Open MPI. The modular infrastructure and lazy 
> initialization there makes this challenging in some places. For example, 
> during communicator creation we don't yet know which collective module will 
> serve which collective operation. Assume we have a hypothetical custom info 
> key "ompi_scatter_blocksize" that is understood by a subset of possible 
> implementations of scatter that the library can choose from at runtime. That 
> choice might not be made until the user actually calls MPI_Scatter. We can 
> track which info keys are supported by the different OMPI modules but we 
> don't know whether the implementation understanding this key will be chosen 
> at the time the user calls MPI_Comm_get_info.
> 
> The question that came up: what is the meaning of "ignored" in this context? 
> Is a key ignored if no part of the implementation cares about it or is it 
> "ignored" if some part understands it but the key may not impact the 
> execution of an operation in a particular instance? Obviously, we lean 
> towards the first because otherwise we'd be forced to eagerly set up 
> everything during communicator creation just to get the info keys right... In 
> any case, we can definitely drop unknown keys like "foo" in Ken's example.
> 
> Thanks
> Joseph
> 
> On 10/19/21 9:44 AM, Raffenetti, Ken via mpi-forum wrote:
>> Thanks Martin. This is my recollection of the discussion, too. I was just 
>> surprised to see this behavior in both MPICH and Open MPI. We did come 
>> across this sentence in the text:
>> 
>>   An implementation must support info objects as caches for arbitrary 
>> (key,value) pairs, regardless of whether it recognizes the key.
>> 
>> But I think we can reasonably understand that to mean an info object that 
>> the user creates. I.e. *not* the info object associated with an MPI 
>> {comm,win,file} handle. MPICH will fix the MPI_FILE_GET_INFO behavior to do 
>> what is stated below.
>> 
>> Thanks,
>> Ken
>> 
>> On 10/18/21, 3:26 PM, "Martin Schulz" <schu...@in.tum.de> wrote:
>> 
>>     Hi Ken,
>> 
>>     To my understanding, this is actually a bug in the MPI library or an 
>> incorrect implementation according to the standard. We had a very lengthy 
>> discussion about this during MPI 4.0 - here is a link to the issue:
>> 
>>     https://github.com/mpi-forum/mpi-issues/issues/63
>> 
>>     We had in there at some point the following principles that guided the 
>> new text:
>> 
>>     Principles:
>>     a) the key/value pairs returned by MPI_GET_INFO must always have been 
>> supplied by the user and not ignored by the MPI library
>>     b) the semantics of supplying an INFO during object creation and during 
>> MPI_SET_INFO should be identically defined
>>     c) the MPI is permitted, but not required, to 'ignore' any INFO 
>> key/value pair supplied by the user
>>     d) if an INFO key/value pair is ignored by MPI, then it will not appear 
>> in subsequent MPI_GET_INFO calls
>>     e) if an INFO key/value pair was not ignored by MPI, then it must appear 
>> in subsequent MPI_GET_INFO calls
>> 
>>     (the same was then applied to all get infos)
>> 
>>     d) seems to be the key aspect here. From what I remember, this was a 
>> deliberate choice, as the assumption back then was that the user wanted to 
>> know what the library actually did with the info (and a library was allowed 
>> to not store other values that are not used). Also, we allowed the return of 
>> a different value - if a library knows and uses the key "foo", but only uses 
>> the first character, the value returned would be "b" - again with the idea 
>> that the user wants to know what is used.
>> 
>>     We had actually discussed a second version of MPI_Get_info that would 
>> return the actual user supplied values, but we never pursued that - perhaps 
>> it is worth reconsidering this again.
>> 
>>     Martin
>> 
>> 
>>     --
>>     Prof. Dr. Martin Schulz, Chair of Computer Architecture and Parallel 
>> Systems
>>     Department of Informatics, TU-Munich, Boltzmannstraße 3, D-85748 Garching
>>     Member of the Board of Directors at the Leibniz Supercomputing Centre 
>> (LRZ)
>>     Email: schu...@in.tum.de
>> 
>> 
>> 
>>     On 18.10.21, 18:12, "mpi-forum on behalf of Raffenetti, Ken via 
>> mpi-forum" <mpi-forum-boun...@lists.mpi-forum.org on behalf of 
>> mpi-forum@lists.mpi-forum.org> wrote:
>> 
>>         Here's an example program that illustrates the current file info 
>> behavior with MPICH and Open MPI. Neither library will abort the application 
>> in my tests. MPICH does not recognize the "foo" key, and I would guess the 
>> same for Open MPI.
>> 
>>         So is this correct behavior according to the standard? Or a bug? 
>> There are users who prefer the current behavior, I suspect in part because 
>> there is no attribute caching interface for MPI file objects.
>> 
>>         #include <mpi.h>
>>         #include <stdlib.h>
>>         #include <stdio.h>
>>         #include <string.h>
>> 
>>         int main(void) {
>>             MPI_Init(NULL, NULL);
>> 
>>             MPI_Info info;
>>             MPI_Info_create(&info);
>>             MPI_Info_set(info, "foo", "bar");
>> 
>>             MPI_File fh;
>>             MPI_File_open(MPI_COMM_SELF, "testfile", MPI_MODE_RDWR | 
>> MPI_MODE_CREATE, info, &fh);
>> 
>>             MPI_Info info_used;
>>             MPI_File_get_info(fh, &info_used);
>> 
>>             char value[4] = "";
>>             int flag;
>>             MPI_Info_get(info_used, "foo", 3, value, &flag);
>> 
>>             if (!flag) {
>>                 printf("key not found\n");
>>                 MPI_Abort(MPI_COMM_WORLD, 1);
>>             }
>> 
>>             if (strcmp(value, "bar")) {
>>                 printf("value does not match\n");
>>                 MPI_Abort(MPI_COMM_WORLD, 1);
>>             }
>> 
>>             MPI_File_close(&fh);
>>             MPI_Info_free(&info);
>>             MPI_Info_free(&info_used);
>> 
>>             MPI_Finalize();
>>             return 0;
>>         }
>> 
>>         On 10/15/21, 2:47 PM, "mpi-forum on behalf of Raffenetti, Ken via 
>> mpi-forum" <mpi-forum-boun...@lists.mpi-forum.org on behalf of 
>> mpi-forum@lists.mpi-forum.org> wrote:
>> 
>>             We brought this to the forum because during our internal 
>> discussion is was shown that ROMIO will return any "ignored" key/value pairs 
>> set by the user in MPI_FILE_GET_INFO. Essentially, ROMIO duplicates the 
>> users info object, and only adjusts settings for keys it recognizes. Perhaps 
>> this is wrong behavior according to MPI-4.0?
>> 
>>             This behavior was introduced into ROMIO somewhat recently in May 
>> 2020. There was a user request for it based on how OMPI-IO behaved. 
>> https://github.com/pmodels/mpich/pull/3954
>> 
>>             Ken
>> 
>>             On 10/13/21, 3:46 PM, "mpi-forum on behalf of Jim Dinan via 
>> mpi-forum" <mpi-forum-boun...@lists.mpi-forum.org on behalf of 
>> mpi-forum@lists.mpi-forum.org> wrote:
>> 
>>                 Hi Rob,
>>                 This seems reasonable to me. I think you can draw an analogy 
>> to an application setting "mpi_assert_no_any_tag = true" and checking the 
>> value returned by MPI_Comm_get_info to decide whether to use MPI_ANY_TAG on 
>> the communicator. I would suggest info values like "true" and "false" so 
>> that ROMIO can return back the same value passed by the user if the info key 
>> is accepted.
>> 
>>                 As of MPI 4.0, MPI_File_get_info must return "all hints that 
>> are supported by the implementation and have default values specified; any 
>> user-supplied hints that were not ignored by the implementation; and any 
>> additional hints that were set by the implementation." So, true/false would 
>> also allow ROMIO to return a default value for the info key if the user 
>> hasn't set it.
>> 
>>                 One thing you can't do with info today is fail the operation 
>> because of a key/value pair in the info argument. If there is an issue with 
>> a particular key/value pair, it is supposed to be ignored.
>> 
>>                  ~Jim.
>> 
>> 
>>                 On Mon, Oct 11, 2021 at 2:13 PM Latham, Robert J. via 
>> mpi-forum <mpi-forum@lists.mpi-forum.org> wrote:
>> 
>> 
>>                 I'd like a way to tell users programmatically about features 
>> ROMIO supports.
>> 
>>                 One way I imagined doing so was through the info object, 
>> though we're having some internal discussion about how kosher that is.  Any 
>> thoughts from the broader MPI community about an approach like this:
>> 
>> 
>>                 ```
>>                 MPI_Info_create(&info);
>>                 MPI_Info_set(info, "romio_feature_xyz", "requested");
>>                 MPI_File_open(...)
>>                 MPI_File_get_info(fh, &info_used)
>>                 MPI_File_get_info(info_used, "romio_feature_xyz", 
>> MPI_MAX_INFO_VAL-1,
>>                     value, &flag);
>> 
>>                 ```
>> 
>>                 Possible points of contention:
>> 
>>                 - we have examples of using hints for clients to pass 
>> information to implementations (striping_factor,  cb_buffer_size) and 
>> examples of implementations passing information to users (cb_nodes on Blue 
>> Gene), but we don't have too many examples of two-way hints.
>> 
>>                 - will MPI_File_get_info return all info keys or just the 
>> ones the implementation understands?  The standard dictates what is required 
>> to be returned.  ROMIO (so de facto standard) will return the union of its 
>> internal keys and any keys (known or unknown) passed in by the user.
>> 
>>                 - The use of "requested" as a hint value that the 
>> implementation can then either set to "enabled" or "disabled"  guards the 
>> situation where a user code is passing these infos to an older MPI IO 
>> implementation.  If the hint comes back exactly as given, then the 
>> implementation ignored this feature request and it is not supported. If the 
>> hint comes back "enabled" then the caller knows something about the 
>> implementation.
>> 
>>                 Thanks
>>                 ==rob
>> 
>> 
>>                 _______________________________________________
>>                 mpi-forum mailing list
>>                 mpi-forum@lists.mpi-forum.org
>>                 https://lists.mpi-forum.org/mailman/listinfo/mpi-forum
>> 
>>             _______________________________________________
>>             mpi-forum mailing list
>>             mpi-forum@lists.mpi-forum.org
>>             https://lists.mpi-forum.org/mailman/listinfo/mpi-forum
>> 
>>         _______________________________________________
>>         mpi-forum mailing list
>>         mpi-forum@lists.mpi-forum.org
>>         https://lists.mpi-forum.org/mailman/listinfo/mpi-forum
>> 
>> 
>> 
>> _______________________________________________
>> mpi-forum mailing list
>> mpi-forum@lists.mpi-forum.org
>> https://lists.mpi-forum.org/mailman/listinfo/mpi-forum
> 
> _______________________________________________
> mpi-forum mailing list
> mpi-forum@lists.mpi-forum.org
> https://lists.mpi-forum.org/mailman/listinfo/mpi-forum

_______________________________________________
mpi-forum mailing list
mpi-forum@lists.mpi-forum.org
https://lists.mpi-forum.org/mailman/listinfo/mpi-forum

Reply via email to