On 8/6/07 1:51 PM, "Jeff Squyres" <jsquy...@cisco.com> wrote:

> On Aug 6, 2007, at 11:49 AM, Ralph H Castain wrote:
> 
>> 1. if everything is being done on localhost, I do not see any of
>> the IO from
>> the child process. Mpirun executes and completes cleanly, however.
>> Because,
>> the spawn'd child terminates so quickly, I haven't been able to
>> positively
>> confirm it is actually running - though I have some indication that
>> it is.
> 
> This is probably my fault somehow;

Isn't everything??  :-)

> I can look into this but not
> immediately.  I'm guessing this is related to the IOF fix that I put
> in last week sometime.  If you can deal without io from the
> COMM_SPAWN children for a little while, I can look at it in a few
> days...

No problem, really - just wanted to ensure someone was aware of it.

> 
>> 2. if running on multiple hosts, I see the output from the child
>> processes,
>> but mpirun "hangs" in MPI_Comm_disconnect. A ctrl-C is able to kill
>> the
>> entire job.
> 
> I can't comment on this one...

Could be related - let's fix the first and see if the second goes away.

Thanks
Ralph

> 
>> Any ideas on what might have happened? This was all working not
>> that long
>> ago...can't swear to an r-level at the moment, but am hoping
>> someone has an
>> idea before I start having to blindly work backwards to find out
>> what broke
>> it.
>> 
>> Thanks
>> Ralph
>> 
>> 
>> _______________________________________________
>> devel-core mailing list
>> devel-c...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel-core
> 


Reply via email to