Excellent idea, Andrea.

Thanks, -Brian


Andrea Righi wrote:
> On Mon, Jan 12, 2009 at 8:08 PM, Brian Elliott Finley <fin...@anl.gov> wrote:
>> Thanks, Ti,
>>
>> Will do.
>>
>> -Brian
>>
>>
>> Ti Leggett wrote:
>>> The machine's been rebooted. Please email supp...@ci.uchicago.edu in the
>>> future instead of Greg or I individually. Thanks.
>>>
>>> On Jan 12, 2009, at 2:51 AM, Andrea Righi wrote:
>>>
>>>> Greg,
>>>> systemimager.ci.uchicago.edu seem down, responding to ping and telnet,
>>>> but nothing else.
>>>> When you have a minute could you try to reset the server?
>>>> Many thanks for your time,
>>>> -Andrea
> 
> Thanks Ti,
> 
> everything's working fine now. We'll write to the support list next time.
> 
> For the other admins/developers (Brian, Bernard, ..): in addition to
> the check-oom.pl script I've configured the kernel with:
>   kernel.panic = 60
>   vm.panic_on_oom = 2
> 
> In case of future OOMs (not prevented by the script) the system will
> compulsorily panic and reboot after 60 sec. Hopefully this will
> finally save all the possible hangs due to OOM.
> 
> -Andrea

-- 
Brian Elliott Finley
CIS / Argonne National Laboratory
Office: 630.252.4742
Mobile: 630.631.6621

------------------------------------------------------------------------------
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
_______________________________________________
sisuite-devel mailing list
sisuite-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/sisuite-devel

Reply via email to