After some discussion on the devel list, I opened 
https://svn.open-mpi.org/trac/ompi/ticket/2904 to track the issue.


On Oct 25, 2011, at 12:08 PM, Ralph Castain wrote:

> FWIW: I have tracked this problem down. The fix is a little more complicated 
> then I'd like, so I'm going to have to ping some other folks to ensure we 
> concur on the approach before doing something.
> 
> On Oct 25, 2011, at 8:20 AM, Ralph Castain wrote:
> 
>> I still see it failing the test George provided on the trunk. I'm unaware of 
>> anyone looking further into it, though, as the prior discussion seemed to 
>> just end.
>> 
>> On Oct 25, 2011, at 7:01 AM, orel wrote:
>> 
>>> Dears,
>>> 
>>> I try from several days to use advanced MPI2 features in the following 
>>> scenario :
>>> 
>>> 1) a master code A (of size NPA) spawns (MPI_Comm_spawn()) two slave
>>>   codes B (of size NPB) and C (of size NPC), providing intercomms A-B and 
>>> A-C ;
>>> 2) i create intracomm AB and AC by merging intercomms ;
>>> 3) then i create intercomm AB-C by calling MPI_Intercomm_create() by using 
>>> AC as bridge...
>>> 
>>>  MPI_Comm intercommABC; A: MPI_Intercomm_create(intracommAB, 0, 
>>> intracommAC, NPA, TAG,&intercommABC);
>>> B: MPI_Intercomm_create(intracommAB, 0, MPI_COMM_NULL, 0,TAG,&intercommABC);
>>> C: MPI_Intercomm_create(intracommC, 0, intracommAC, 0, TAG,&intercommABC);
>>> 
>>>    In these calls, A0 and C0 play the role of local leader for AB and C 
>>> respectively.
>>>    C0 and A0 play the roles of remote leader in bridge intracomm AC.
>>> 
>>> 3)  MPI_Barrier(intercommABC);
>>> 4)  i merge intercomm AB-C into intracomm ABC$
>>> 5)  MPI_Barrier(intracommABC);
>>> 
>>> My BUG: These calls success, but when i try to use intracommABC for a 
>>> collective communication like MPI_Barrier(),
>>>             i got the following error :
>>> 
>>> *** An error occurred in MPI_Barrier
>>> *** on communicator
>>> *** MPI_ERR_INTERN: internal error
>>> *** MPI_ERRORS_ARE_FATAL: your MPI job will now abort
>>> 
>>> 
>>> I try with OpenMPI trunk, 1.5.3, 1.5.4 and Mpich2-1.4.1p1
>>> 
>>> My code works perfectly if intracomm A, B and C are obtained by 
>>> MPI_Comm_split() instead of MPI_Comm_spawn() !!!!
>>> 
>>> 
>>> I found same problem in a previous thread of the OMPI Users mailing list :
>>> 
>>> => http://www.open-mpi.org/community/lists/users/2011/06/16711.php
>>> 
>>> Is that bug/problem is currently under investigation ? :-)
>>> 
>>> i can give detailed code, but the one provided by George Bosilca in this 
>>> previous thread provides same error...
>>> 
>>> Thank you to help me...
>>> 
>>> -- 
>>> Aurélien Esnard
>>> University Bordeaux 1 / LaBRI / INRIA (France)
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>> 
> 
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/


Reply via email to