Re: [CentOS] Hard I/O lockup with EL6

2011-09-27 Thread Emmanuel Noobadmin
On 9/27/11, Benjamin Smith li...@benjamindsmith.com wrote: I wish you the best of luck! Fortunately (or unfortunately depending on how one looks at it), mine appears to be just bad sectors developing on one of the newest drive I added to the machine as part of a mdadm RAID 1 array. After I

[CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
I'm trying to figure out why 2 machines have a hard I/O lock on the HDD when running EL6. I have 4 identical machines, all were stable with EL5. 2 work great with EL6, 2 do not. I've checked momtherboard BIOS versions and settings, SAS controller BIOS versions and settings, they are the same

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread m . roth
Benjamin Smith wrote: I'm trying to figure out why 2 machines have a hard I/O lock on the HDD when running EL6. I have 4 identical machines, all were stable with EL5. 2 work great with EL6, 2 do not. I've checked momtherboard BIOS versions and settings, SAS controller BIOS versions and

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 12:36:19 PM m.r...@5-cent.us wrote: a) have you checked /var/log/message for memory or drive errors? Looked through the logs, there's *nothing* I can find that's out of sorts. When the IO problem happens, nothing can be written. Maybe memtest86? I replaced

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Brian McKerr
Have you checked the cables you are using ? On Tue, Sep 27, 2011 at 6:09 AM, Benjamin Smith li...@benjamindsmith.comwrote: On Monday, September 26, 2011 12:36:19 PM m.r...@5-cent.us wrote: a) have you checked /var/log/message for memory or drive errors? Looked through the logs, there's

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 02:00:52 PM Brian McKerr wrote: Have you checked the cables you are using ? There are none - it's a front-loaded hot-swap rackmount. The systems are stable under EL5. -- This message has been scanned for viruses and dangerous content by MailScanner, and is

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 02:42:18 PM Devin Reade wrote: --On Monday, September 26, 2011 12:11:47 PM -0700 Benjamin Smith Unfortunately in trying to use C6 on the old machine I wound up with far too many changed variables to figure out where the problem was. Despite that, my gut tells me

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Scott Silva
on 9/26/2011 3:13 PM Benjamin Smith spake the following: On Monday, September 26, 2011 02:42:18 PM Devin Reade wrote: --On Monday, September 26, 2011 12:11:47 PM -0700 Benjamin Smith Unfortunately in trying to use C6 on the old machine I wound up with far too many changed variables to figure

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Devin Reade
--On Monday, September 26, 2011 03:13:09 PM -0700 Benjamin Smith li...@benjamindsmith.com wrote: Thanks for the feedback. Unfortunately, these aren't ancient 686 systems, they are 1-ish year old 8-core Intel Xeons with 32 GB of ECC RAM apiece. I can't justify replacing them, especially since

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Emmanuel Noobadmin
On 9/27/11, Benjamin Smith li...@benjamindsmith.com wrote: When booting a non-working system, it boots straight up to the boot prompt (runlevel 3) without issue, and everything works fine. When the machine sits idle for a period of time (ranging from 15 minutes or so and up) the HDD becomes

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 10:16:14 PM Emmanuel Noobadmin wrote: On 9/27/11, Benjamin Smith li...@benjamindsmith.com wrote: When booting a non-working system, it boots straight up to the boot prompt (runlevel 3) without issue, and everything works fine. When the machine sits idle for a