Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-22 Thread ken
On 10/21/2009 10:21 PM Philip Gwyn wrote: On 20-Oct-2009 Michael Schumacher wrote: I've got a production system running CentOS 4 that was rock solid until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running 2.6.9-89.0.11). The system now crashes intermittently after a few weeks. I finally

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-22 Thread William L. Maltby
On Thu, 2009-10-22 at 04:20 -0400, ken wrote: snip cat /boot/grub/menu.lst ... title CentOS (2.6.18-164.2.1.el5.plus) root (hd0,2) kernel /vmlinuz-2.6.18-164.2.1.el5.plus ro root=/dev/mapper/luks-3d723b4f-0184-438d-9cb9-9ebff16e683a rhgb quiet initrd

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-21 Thread Philip Gwyn
On 20-Oct-2009 Michael Schumacher wrote: I've got a production system running CentOS 4 that was rock solid until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running 2.6.9-89.0.11). The system now crashes intermittently after a few weeks. I finally caught the panic message : EDAC MC0:

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-20 Thread Michael Schumacher
Chris, I've got a production system running CentOS 4 that was rock solid until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running 2.6.9-89.0.11). The system now crashes intermittently after a few weeks. I finally caught the panic message : EDAC MC0: INTERNAL ERROR: channel-b out of

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-20 Thread Chris Miller
nate wrote: Check your bios/system event log for any indication that it is logging memory errors? Most modern server class motherboards (past 5 years) do this, though not always reliably. Nothing in the logs, it's a Supermicro X7DVL-E (fyi). I've also had trouble with memtest86 myself, I

[CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-19 Thread Chris Miller
I've got a production system running CentOS 4 that was rock solid until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running 2.6.9-89.0.11). The system now crashes intermittently after a few weeks. I finally caught the panic message : EDAC MC0: INTERNAL ERROR: channel-b out of range (4 = 4)

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-19 Thread nate
Chris Miller wrote: Thoughts? Check your bios/system event log for any indication that it is logging memory errors? Most modern server class motherboards (past 5 years) do this, though not always reliably. I've also had trouble with memtest86 myself, I prefer to run ctcs: