Here's a real simple diagnostic you can do: set -mca plm_base_verbose 1 and
look at the cmd line being executed (send it here). It will look like:

[[xxx,1],0] plm:rsh: executing: jjkljks;jldfsaj;

If the cmd line has --daemonize on it, then the ssh will close and xterm
won't work.

Ralph



On 4/2/08 3:14 PM, "Jeff Squyres" <jsquy...@cisco.com> wrote:

> Can you diagnose a little further:
> 
> 1. in the case where it works, can you verify that the ssh to launch
> the orteds is still running?
> 
> 2. in the case where it doesn't work, can you verify that the ssh to
> launch the orteds has actually died?
> 
> 
> On Apr 2, 2008, at 4:58 PM, Jon Mason wrote:
>> On Wednesday 02 April 2008 01:21:31 pm Jon Mason wrote:
>>> On Wednesday 02 April 2008 11:54:50 am Ralph H Castain wrote:
>>>> I remember that someone had found a bug that caused
>>>> orte_debug_flag to not
>>>> get properly set (local var covering over a global one) - could be
>>>> that
>>>> your tmp-public branch doesn't have that patch in it.
>>>> 
>>>> You might try updating to the latest trunk
>>> 
>>> I updated my ompi-trunk tree, did a clean build, and I still seem
>>> the same
>>> problem.  I regressed trunk to rev 17589 and everything works as I
>>> expect.
>>> So I think the problem is still there in the top of trunk.
>> 
>> 
>> I stepped through the revs of trunk and found the first failing rev
>> to be
>> 17632.  Its a big patch, so I'll defer to those more in the know to
>> determine
>> what is breaking in there.
>> 
>> 
>>> I don't discount user error, but I don't think I am doing anyting
>>> different.
>>> Did some setting change that perhaps I did not modify?
>>> 
>>> Thanks,
>>> Jon
>>> 
>>>> On 4/2/08 10:41 AM, "George Bosilca" <bosi...@eecs.utk.edu> wrote:
>>>>> I'm using this feature on the trunk with the version from
>>>>> yesterday.
>>>>> It works without problems ...
>>>>> 
>>>>>   george.
>>>>> 
>>>>> On Apr 2, 2008, at 12:14 PM, Jon Mason wrote:
>>>>>> On Wednesday 02 April 2008 11:07:18 am Jeff Squyres wrote:
>>>>>>> Are these r numbers relevant on the /tmp-public branch, or the
>>>>>>> trunk?
>>>>>> 
>>>>>> I pulled it out of the command used to update the branch, which
>>>>>> was:
>>>>>> svn merge -r 17590:17917 https://svn.open-mpi.org/svn/ompi/trunk .
>>>>>> 
>>>>>> In the cpc tmp branch, it happened at r17920.
>>>>>> 
>>>>>> Thanks,
>>>>>> Jon
>>>>>> 
>>>>>>> On Apr 2, 2008, at 11:59 AM, Jon Mason wrote:
>>>>>>>> I regressed my tree and it looks like it happened between
>>>>>>>> 17590:17917
>>>>>>>> 
>>>>>>>> On Wednesday 02 April 2008 10:22:52 am Jon Mason wrote:
>>>>>>>>> I am noticing that ssh seems to be broken on trunk (and my cpc
>>>>>>>>> branch, as
>>>>>>>>> it is based on trunk).  When I try to use xterm and gdb to
>>>>>>>>> debug, I
>>>>>>>>> only
>>>>>>>>> successfully get 1 xterm.  I have tried this on 2 different
>>>>>>>>> setups.  I can
>>>>>>>>> successfully get the xterm's on the 1.2 svn branch.
>>>>>>>>> 
>>>>>>>>> I am running the following command:
>>>>>>>>> mpirun --n 2 --host vic12,vic20 -mca btl tcp,self -d xterm -e
>>>>>>>>> gdb /usr/mpi/gcc/openmpi-1.2-svn/tests/IMB-3.0/IMB-MPI1
>>>>>>>>> 
>>>>>>>>> Is anyone else seeing this problem?
>>>>>>>>> 
>>>>>>>>> Thanks,
>>>>>>>>> Jon
>>>>>>>>> _______________________________________________
>>>>>>>>> devel mailing list
>>>>>>>>> de...@open-mpi.org
>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>>>> 
>>>>>>>> _______________________________________________
>>>>>>>> devel mailing list
>>>>>>>> de...@open-mpi.org
>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>> 
>>>>>> _______________________________________________
>>>>>> devel mailing list
>>>>>> de...@open-mpi.org
>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>> 
>>>>> _______________________________________________
>>>>> devel mailing list
>>>>> de...@open-mpi.org
>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>> 
>>>> _______________________________________________
>>>> devel mailing list
>>>> de...@open-mpi.org
>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> 
>>> 
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> 
>> 
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 


Reply via email to