On Mon, Feb 27, 2006 at 11:33:19PM +0100, Juan PiƱeros wrote:
1 Raw_Read_Error_Rate 0x000d 100 100 050
Pre-fail Offline
- 51
195 Hardware_ECC_Recovered 0x001a 100 100 000
Old_age Always
- 2
199 UDMA_CRC_Error_Count0x003e 200 200 000
Old_age Always
- 9
This is where your issue seems to live. I have never seen the read
error and ecc corrected number not matching. It would mean that an error
occurs but there has been no way to make it right so I would expect the
read to be garbage... Did you see any corruption in your files? I mean
data corrupted instead of metadata?
Also, you say that sata does not support smart. That is not true, with
one of the very recent kernels (2.6.15.4), you can get them. I have not
much experience with the kernels shipped with debian. I always recompile
my own. But some problems I had with an nfs server (in an HPC system)
vanished when I upgraded from 2.6.12 to 2.6.14. There was a bug with the
futex, and I think that was the source of my problems (race conditions
are always nasty).
As for the udma crc? That usually means that your controller/cable is
going bad. Each time I have seen that, the whole system crashed
corrupting files everywhere... That is pretty odd that you see the thig
on two different system though.
jacques
PS: With development kernels, always try to use the latest. Especially
when you see a problem. (And I still consider the 2.6 as being a
development version)
signature.asc
Description: Digital signature