Re: FreeBSD 6.2-REL, system lockup, recovers when keyboard pressed

2008-03-12 Thread Dale Shaw
Hi again all,

Just an update on my problem (see below).

I upgraded the box to 6.3-REL and the problem persisted -- exactly the
same behaviour.

I've narrowed the problem down to nfdump though -- without the NetFlow
collectors (nfcapd) running, the box is rock solid.

If anyone out there happens to have seen this problem (with nfdump and
friends) before, or has some general advice for troubleshooting
something like this (I suspect some system resource tuning may be
required), please drop me a line.

In the meantime, I'll head on over to the nfdump list.

cheers,
Dale

  On Thu, Feb 28, 2008 at 10:57 PM, Kris Kennaway [EMAIL PROTECTED] wrote:
  
   Dale Shaw wrote:
 Hi all,

  [...]
 I have a vanilla 6.2-RELEASE system running a bunch of network
 management type tools like RANCID, nfcapd, cacti and so on.

 After a few days of normal operation, the system (locked away in a
 data centre) falls off the network. Can't SSH to it, can't ping it. No
 ARP -- gone! I have no OOB access to this machine (it's a test
 box/play pen).
  
I have a vague memory of something like this but cannot point to a
specific commit that resolved it.
  
Kris
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


FreeBSD 6.0-REL, system lockup, recovers when keyboard pressed

2008-02-28 Thread Dale Shaw
Hi all,

I'm sorry if this is a FAQ. I searched but couldn't find a direct
match yada yada..

I have a vanilla 6.0-RELEASE system running a bunch of network
management type tools like RANCID, nfcapd, cacti and so on.

After a few days of normal operation, the system (locked away in a
data centre) falls off the network. Can't SSH to it, can't ping it. No
ARP -- gone! I have no OOB access to this machine (it's a test
box/play pen).

Strangely, when I drive out and visit the machine (a HP DL320) in
person and press enter a couple of times on the keyboard, it springs
back to life like nothing ever happened. I literally see (for example)
log entries dated Feb  07 immediately followed by entries dated Feb
28. Some processes lose the plot and need to be restarted, but others
just continue on their merry way.

In my searching I have found a couple of references to dodgy keyboard
drivers, problems with systems on KVM switches (like this one is), and
power management issues.

Any clues? Unfortunately tracking -STABLE is not really an option for
me on this box, but I'm more than happy to update and build a new
kernel on a once-off basis if someone says it's a known bug/problem
sorted out in a post-6.0-RELEASE fix.

cheers,
Dale
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 6.2-REL, system lockup, recovers when keyboard pressed

2008-02-28 Thread Dale Shaw
Sorry all, I typo'd -- the system is 6.2-REL, not 6.0-REL.

Does that make the answer any clearer? (maybe it's fresher in people's minds?)

Is confidence high that an update to 6.2-STABLE would sort this out?
(I'd really love a bug fix reference).

cheers,
Dale

On Thu, Feb 28, 2008 at 10:57 PM, Kris Kennaway [EMAIL PROTECTED] wrote:

 Dale Shaw wrote:
   Hi all,
  
[...]
   I have a vanilla 6.2-RELEASE system running a bunch of network
   management type tools like RANCID, nfcapd, cacti and so on.
  
   After a few days of normal operation, the system (locked away in a
   data centre) falls off the network. Can't SSH to it, can't ping it. No
   ARP -- gone! I have no OOB access to this machine (it's a test
   box/play pen).

  I have a vague memory of something like this but cannot point to a
  specific commit that resolved it.

  Kris
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 6.2-REL, system lockup, recovers when keyboard pressed

2008-02-28 Thread Dale Shaw
Hi Sean,

On Fri, Feb 29, 2008 at 8:57 AM, Sean Cavanaugh
[EMAIL PROTECTED] wrote:

 You look at upgrading to 6.3-REL or 7.0-REL?

Well, I could, but that's a sledgehammer approach and while likely to
work, in the absence of a bug report/fix (I'm not saying there isn't
one), it is not guaranteed to work. For example, it might be freezing
up because of something I can control with configuration (loader.conf
stuff). I'll have to stop the processes for a while and see if I can
reproduce the behaviour while the system is essentially idle.

I'm certainly willing to upgrade but it would be good to go into that
process with more confidence of success.

cheers,
Dale
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]