Clearing SMART errors I don't care about?

2008-12-19 Thread Kirk Strauser
I have a 250GB drive that I use exclusively for unimportant data.  I  
run smartd to watch the status of my drives and get regular output like:


smartd[1409]: Device: /dev/ad8, 4 Offline uncorrectable sectors

I've been seeing that exact message for a couple of years now and  
don't particularly care.  I'd like to know if the drive's about to  
roll over and die, but I really don't need to know about the same 4  
offline uncorrectable errors.


Is there a way to clear that message while leaving SMART enabled on  
the drive?  I didn't see anything specifically answering it in  
smartctl(8) but wondered if I missed something.

--
Kirk Strauser



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Wojciech Puchar

smartd to watch the status of my drives and get regular output like:

smartd[1409]: Device: /dev/ad8, 4 Offline uncorrectable sectors

I've been seeing that exact message for a couple of years now and don't 
particularly care.  I'd like to know if the drive's about to roll over and 
die, but I really don't need to know about the same 4 offline uncorrectable 
errors.


Is there a way to clear that message while leaving SMART enabled on the 
drive?  I didn't see anything specifically answering it in smartctl(8) but 
wondered if I missed something.


maybe some time there was an error while writing something, or maybe media 
failed in that place.


by rewriting whole drive you should be able to remove them
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Kirk Strauser

On Dec 19, 2008, at 10:03 AM, Wojciech Puchar wrote:


by rewriting whole drive you should be able to remove them



I tried running dd if=/dev/zero of=/dev/ad8 bs=1m a while back but  
I'm still getting them.  At this point, I imagine that I really do  
have bad sectors.  I'm fine with that.  I just don't want to hear  
about them any more.  :-)

--
Kirk Strauser





PGP.sig
Description: This is a digitally signed message part


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Tom Worster
On 12/19/08 11:46 AM, Kirk Strauser k...@strauser.com wrote:

 I just don't want to hear about them any more.  :-)

reconfigure smartd?

i'm not convinced that smart monitoring is of much value even on disks with
important data.


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Kirk Strauser

On Dec 19, 2008, at 11:10 AM, James Tanis wrote:

To answer your question, I don't believe smartd is sophisticated  
enough to filter out specific errors. It's meant to warn you at the  
first sign of drive failure so you will have time to replace the  
drive. It doesn't exactly provide a meter of how imminent drive  
failure is. If your going to start ignoring it's advice it isn't  
going to be very useful to you at all.



I beg to differ.  smartctl -H /dev/ad8 says that it passes its self- 
assessment and doesn't expect the drive to flat-out die any day soon.   
I'd still like to know if the error count increased, or if it started  
to detect imminent failure.

--
Kirk Strauser



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Zbigniew Szalbot
Hi,

 i'm not convinced that smart monitoring is of much value even on disks with
 important data.

It is if taken seriously. A smartd error was once ignored, the rest
(including data) is history.

Thank you for FreeBSD!

-- 
Zbigniew Szalbot
www.fairtrade.net.pl
www.slowo.pl
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Tom Worster
On 12/19/08 12:38 PM, Kirk Strauser k...@strauser.com wrote:

 
 I beg to differ.  smartctl -H /dev/ad8 says that it passes its self-
 assessment and doesn't expect the drive to flat-out die any day soon.
 I'd still like to know if the error count increased, or if it started
 to detect imminent failure.

there are plenty of hdd failure modes that smart can't predict. google's
monitoring of over 100,000 hdds over 9 months showed more than a third of
failures had no warning from smart.

http://storagemojo.com/2007/02/19/googles-disk-failure-experience/

so if my data matters, i need a robust hdd failure-tolerant system anyway,
i.e. raid (even if it's just gmirror, which i use for non-critical servers)
plus data snapshots to a remote site.

now, with that in place, what do i do with a smart warning? given that smart
algorithms are also prone to false positives, is there any benefit in
replacing the hdd now rather than waiting for it to fail and replacing it
then? not a great deal in my view.

but perhaps my raid array can't tolerate more than one hdd failure. i'd be
exposed to a second disk failure during the time to repair. if hdd failures
are independent (which i guess might not always be true) this isn't a big
concern. less of a concern than, for example, the chance of raid controller
failure, which i've seen happen. one time when that happened, the controller
corrupted all the disks in the array and when it was replaced rebuild was
impossible.



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Wojciech Puchar

failures had no warning from smart.

http://storagemojo.com/2007/02/19/googles-disk-failure-experience/


which means that in 2 cases of 3 you ARE warned.

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: Clearing SMART errors I don't care about?

2008-12-19 Thread Tom Worster
On 12/19/08 2:51 PM, Wojciech Puchar woj...@wojtek.tensor.gdynia.pl
wrote:

 which means that in 2 cases of 3 you ARE warned.

yes. but what do i do with a smart warning? the google paper indicates that
even they haven't figured it out yet, although they express some hope.

Despite those strong correlations, we Þnd that failure prediction models
based on SMART parameters alone are likely to be severely limited in their
prediction accuracy, given that a large fraction of our failed drives have
shown no SMART error signals whatsoever. This result suggests that SMART
models are more useful in predicting trends for large aggregate populations
than for individual components.

in managing my servers, it's the failure of individual components that
count.

considering also 1) false positives and 2) replacement comes with a finite
chance of causing service outage, it's not clear to me that replacing a disk
on the basis of a smart warning is prudent.

i'm not proposing any firm conclusions should be drawn. i started this by
saying i'm not convinced that smart monitoring is of much value... 


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org