Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-07-07 Thread Mattias Wadenstein
On Fri, 7 Jul 2006, Neil Brown wrote: On Thursday July 6, [EMAIL PROTECTED] wrote: I suggest you find a SATA related mailing list to post this to (Look in the MAINTAINERS file maybe) or post it to linux-kernel. linux-ide couldn't help much, aside from recommending a bleeding-edge patchset

Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-07-07 Thread Doug Ledford
On Fri, 2006-07-07 at 00:29 +0200, Christian Pernegger wrote: I don't know exactly how the driver was responding to the bad cable, but it clearly wasn't returning an error, so md didn't fail it. There were a lot of errors in dmesg -- seems like they did not get passed up to md? I find it

Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-07-06 Thread Christian Pernegger
I suggest you find a SATA related mailing list to post this to (Look in the MAINTAINERS file maybe) or post it to linux-kernel. linux-ide couldn't help much, aside from recommending a bleeding-edge patchset which should fix a lot of things SATA: http://home-tj.org/files/libata-tj-stable/

Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-07-06 Thread Neil Brown
On Thursday July 6, [EMAIL PROTECTED] wrote: I suggest you find a SATA related mailing list to post this to (Look in the MAINTAINERS file maybe) or post it to linux-kernel. linux-ide couldn't help much, aside from recommending a bleeding-edge patchset which should fix a lot of things

Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-07-06 Thread Christian Pernegger
md is very dependant on the driver doing the right thing. It doesn't do any timeouts or anything like that - it assumes the driver will. md simply trusts the return status from the drive, and fails a drive if and only if a write to the drive is reported as failing (if a read fails, md trys to

Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-07-01 Thread Christian Pernegger
Looks very much like a problem with the SATA controller. If the repeat look you have shown there is an infinite loop, then presumably some failure is not being handled properly. I agree, even though the AHCI driver was supposed to be stable. The loop is not quite infinite btw., it does time out

Re: Strange intermittant errors + RAID doesn't fail the disk.

2006-06-30 Thread Neil Brown
On Friday June 30, [EMAIL PROTECTED] wrote: More problems ... As reported I have 4x WD5000YS (Caviar RE2 500 GB) in a md RAID5 array. I've been benchmarking and otherwise testing the new array these last few days, and apart from the fact that the md doesn't shut down properly I've had no