OK--rebuilt everything and trying ssh CLI . Sorry for the noise
(flight delay!) Ron
---
Ronald Cohen
Geophysical Laboratory
Carnegie Institution
5251 Broad Branch Rd., N.W.
Washington, D.C. 20015
[email protected]
office: 202-478-8937
skype: ronaldcohen
https://twitter.com/recohen3
https://www.linkedin.com/profile/view?id=163327727


On Thu, Aug 20, 2015 at 8:31 AM, Cohen, Ronald
<[email protected]> wrote:
> Sorry--while the IT group tried to trace down this problem they
> reinstalled or deleted some library and now xtalopt fails all the
> time. I am getting on a plane--will have to try to fix later. Sorry to
> bother you. Ron
> ---
> Ronald Cohen
> Geophysical Laboratory
> Carnegie Institution
> 5251 Broad Branch Rd., N.W.
> Washington, D.C. 20015
> [email protected]
> office: 202-478-8937
> skype: ronaldcohen
> https://twitter.com/recohen3
> https://www.linkedin.com/profile/view?id=163327727
>
>
> On Thu, Aug 20, 2015 at 7:57 AM, Cohen, Ronald
> <[email protected]> wrote:
>> I can't get a job to work for more than a few hours when it fails with:
>>
>> SSHConnectionLibSSH::isConnected(): server timeout.
>>
>> SSH error:  Failed to resolve hostname legion.rc.ucl.ac.uk (Name or
>> service not known)
>>
>> "Cannot connect to ssh server [email protected]:22"
>>
>> Warning:  "Cannot connect to ssh server"
>>
>> SSHConnectionLibSSH::isConnected(): server timeout.
>>
>> SSH error:  Failed to resolve hostname legion.rc.ucl.ac.uk (Name or
>> service not known)
>>
>> "Cannot connect to ssh server [email protected]:22"
>>
>> Warning:  "Cannot connect to ssh server"
>>
>> Meanwhile we had ping running in another window and it showed no
>> errors and loss of network or nameservice.
>>
>> I think the host just didn't respond to the ssh call immediately and
>> the call timed out and xtalopt then dies.
>> I know how to fix this but have to find time.
>>
>> Ron
>>
>> ---
>> Ronald Cohen
>> Geophysical Laboratory
>> Carnegie Institution
>> 5251 Broad Branch Rd., N.W.
>> Washington, D.C. 20015
>> [email protected]
>> office: 202-478-8937
>> skype: ronaldcohen
>> https://twitter.com/recohen3
>> https://www.linkedin.com/profile/view?id=163327727
>>
>>
>> On Sun, Aug 16, 2015 at 4:35 PM, Patrick Avery <[email protected]> wrote:
>>> Hey Ron,
>>>
>>> So, we have been making several updates for a new release that is coming out
>>> soon. We MIGHT have already fixed this issue (although I don't recall
>>> explicitly fixing it). But I ran a test today to see what would happen. Let
>>> me know if you think this test adequately mimics your glitch that you found:
>>>
>>> I submitted a couple of jobs with XtalOpt, then disconnected my wifi for
>>> about 20 seconds (so the connection to the remote cluster would fail). Then,
>>> I reconnected it, and it read the output from the runs and updated
>>> successfully - no job restarts.
>>>
>>> I tried it again for a longer period of time (I disconnected the wifi for
>>> about 3 minutes). After several server timeouts (and it mentioned "Warning:
>>> "Cannot connect to ssh server"" three times in that time period), I
>>> reconnected the wifi. Unfortunately, the run did not continue - it appeared
>>> to be frozen (something we may want to fix). But after exiting out and
>>> resuming the run, it took it a while, but it updated the structures
>>> successfully from the output - no job restarts.
>>>
>>> Thanks,
>>> Patrick
>>>
>>> On Fri, Aug 14, 2015 at 4:35 PM, Cohen, Ronald <[email protected]>
>>> wrote:
>>>>
>>>> I had fixed this in an earlier version but don't remember how.
>>>> Sometimes the connection to the server or nameserver goes down (about
>>>> once a day) and I see an error like:
>>>>
>>>> SSHConnectionLibSSH::isConnected(): server timeout.
>>>> SSH error:  Failed to resolve hostname legion.rc.ucl.ac.uk (Name or
>>>> service not known)
>>>> "Cannot connect to ssh server [email protected]:22"
>>>> Warning:  "Cannot connect to ssh server"
>>>>
>>>> However, jobs are still running on the server and it comes back, but
>>>> in the meantime xtalopt hangs and never recovers without a restart,
>>>> and loss of the running jobs. It should just wait until the server
>>>> connection comes back.
>>>>
>>>> Ron
>>>>
>>>> ---
>>>> Ronald Cohen
>>>> Geophysical Laboratory
>>>> Carnegie Institution
>>>> 5251 Broad Branch Rd., N.W.
>>>> Washington, D.C. 20015
>>>> [email protected]
>>>> office: 202-478-8937
>>>> skype: ronaldcohen
>>>> https://twitter.com/recohen3
>>>> https://www.linkedin.com/profile/view?id=163327727
>>>
>>>

------------------------------------------------------------------------------
_______________________________________________
Avogadro-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/avogadro-devel

Reply via email to