> From: zfs-discuss-boun...@opensolaris.org [mailto:zfs-discuss-
> boun...@opensolaris.org] On Behalf Of Stephan Budach
> 
> You are implying that the issues resulted from the H/W raid(s) and I
> don't think that this is appropriate.

Please quote originals when you reply.  If you don't - then it's easy to
follow the thread on the web forum, but not in email.  So if you don't
quote, you'll be losing a lot of the people following the thread.  

I think it's entirely appropriate to imply that your problem this time stems
from hardware.  I'll say it outright.  You have a hardware problem.  Because
if there is a repeatable checksum failure (bad disk) then if anything can
find it, scrub can.  And scrub is the best way to find it.

If you have a nonrepeatable checksum failure (such as you have) then there
is only one possibility.  You are experiencing a hardware problem.

One possibility is that there's a failing disk in your hardware raid set,
and your hardware raid controller is unable to detect it, because hardware
raid doesn't do checksumming.  Sometimes ZFS reads the device, and gets an
error.  Sometimes the hardware raid controller reads the other side of the
mirror, and there is no error.

This is not the only possibility.  There could be some other piece of
hardware yielding your intermittent checksum errors.  But there's one
absolute conclusion:  Your intermittent checksum errors are caused by
hardware.

If scrub didn't find an error, then there was no error at the time of scrub.

If scrub didn't find an error, and then something else *did* find an error,
it means one of two things.  (a) Maybe the error only occurred after the
scrub.  or (b) the hardware raid controller or some other piece of hardware
didn't produce corrupted data during the scrub, but will produce corrupted
data at some other time.

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to