On 8/6/07 1:51 PM, "Jeff Squyres" <jsquy...@cisco.com> wrote:
> On Aug 6, 2007, at 11:49 AM, Ralph H Castain wrote:
>
>> 1. if everything is being done on localhost, I do not see any of
>> the IO from
>> the child process. Mpirun executes and completes cleanly, however.
>> Because,
>> the spawn'd child terminates so quickly, I haven't been able to
>> positively
>> confirm it is actually running - though I have some indication that
>> it is.
>
> This is probably my fault somehow;
Isn't everything?? :-)
> I can look into this but not
> immediately. I'm guessing this is related to the IOF fix that I put
> in last week sometime. If you can deal without io from the
> COMM_SPAWN children for a little while, I can look at it in a few
> days...
No problem, really - just wanted to ensure someone was aware of it.
>
>> 2. if running on multiple hosts, I see the output from the child
>> processes,
>> but mpirun "hangs" in MPI_Comm_disconnect. A ctrl-C is able to kill
>> the
>> entire job.
>
> I can't comment on this one...
Could be related - let's fix the first and see if the second goes away.
Thanks
Ralph
>
>> Any ideas on what might have happened? This was all working not
>> that long
>> ago...can't swear to an r-level at the moment, but am hoping
>> someone has an
>> idea before I start having to blindly work backwards to find out
>> what broke
>> it.
>>
>> Thanks
>> Ralph
>>
>>
>> _______________________________________________
>> devel-core mailing list
>> devel-c...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel-core
>