Re[2]: [zfs-discuss] Google paper on disk reliability

2007-02-21 Thread Robert Milkowski
Hello Jesus,

Wednesday, February 21, 2007, 5:54:35 AM, you wrote:

JC -BEGIN PGP SIGNED MESSAGE-
JC Hash: SHA1

JC Joerg Schilling wrote:
 What they missed to say is that you need to access the whole disk
 frequently enough in order to give SMART the ability to work.

JC I thought modern disks could be instructed to do offline scanning,
JC using any idle time available.

it was mentioned also in the paper

-- 
Best regards,
 Robertmailto:[EMAIL PROTECTED]
   http://milek.blogspot.com

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Google paper on disk reliability

2007-02-20 Thread Joerg Schilling
Richard Elling [EMAIL PROTECTED] wrote:

  
  Link to the paper is http://labs.google.com/papers/disk_failures.pdf

 As for the spares debate, that is easy: use spares :-)

What they missed to say is that you need to access the whole disk
frequently enough in order to give SMART the ability to work.

Jörg

-- 
 EMail:[EMAIL PROTECTED] (home) Jörg Schilling D-13353 Berlin
   [EMAIL PROTECTED](uni)  
   [EMAIL PROTECTED] (work) Blog: http://schily.blogspot.com/
 URL:  http://cdrecord.berlios.de/old/private/ ftp://ftp.berlios.de/pub/schily
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Google paper on disk reliability

2007-02-20 Thread Jesus Cea
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Joerg Schilling wrote:
 What they missed to say is that you need to access the whole disk
 frequently enough in order to give SMART the ability to work.

I thought modern disks could be instructed to do offline scanning,
using any idle time available.


[EMAIL PROTECTED] video]# smartctl -a /dev/hda
...
General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection:
Enabled.
...


- --
Jesus Cea Avion _/_/  _/_/_/_/_/_/
[EMAIL PROTECTED] http://www.argo.es/~jcea/ _/_/_/_/  _/_/_/_/  _/_/
jabber / xmpp:[EMAIL PROTECTED] _/_/_/_/  _/_/_/_/_/
   _/_/  _/_/_/_/  _/_/  _/_/
Things are not so easy  _/_/  _/_/_/_/  _/_/_/_/  _/_/
My name is Dump, Core Dump   _/_/_/_/_/_/  _/_/  _/_/
El amor es poner tu felicidad en la felicidad de otro - Leibniz
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iQCVAwUBRdvQi5lgi5GaxT1NAQKbPwP+N9PtmXu/bO3YegGtppZzo3McWanUVBAr
rfnW10AbrYZ1RgtqQ/nofB8AugzK/zkIuB80EyUFraJ0ZvxMEKgtK9mQilwWiA3f
TOQOUPq/uwzK2y6XtQUwfhnWqbXJPAWYPdQ1nBxEKRBtyarjxG7rE9MbsWMJ7lj2
EY1zf9OoEgg=
=kcIg
-END PGP SIGNATURE-
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Google paper on disk reliability

2007-02-19 Thread Richard Elling

Akhilesh Mritunjai wrote:

I believe that the word would have gone around already, Google engineers have 
published a paper on disk reliability. It might supplement the ZFS FMA 
integration and well - all the numerous debates on spares etc etc over here.


Good paper.  They validate the old saying, complex systems fail in complex 
ways.
We've also done some internal (Sun) studies which cast doubt on the ability of 
SMART
to predict failures.


To quote /.

The Google engineers just published a paper on Failure Trends in a Large Disk Drive 
Population. Based on a study of 100,000 disk drives over 5 years they find some 
interesting stuff. To quote from the abstract: 'Our analysis identifies several 
parameters from the drive's self monitoring facility (SMART) that correlate highly with 
failures. Despite this high correlation, we conclude that models based on SMART 
parameters alone are unlikely to be useful for predicting individual drive failures. 
Surprisingly, we found that temperature and activity levels were much less correlated 
with drive failures than previously reported.'

Link to the paper is http://labs.google.com/papers/disk_failures.pdf


As for the spares debate, that is easy: use spares :-)
 -- richard
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Google paper on disk reliability

2007-02-19 Thread Torrey McMahon

Richard Elling wrote:

Akhilesh Mritunjai wrote:
I believe that the word would have gone around already, Google 
engineers have published a paper on disk reliability. It might 
supplement the ZFS FMA integration and well - all the numerous 
debates on spares etc etc over here.


Good paper.  They validate the old saying, complex systems fail in 
complex ways.
We've also done some internal (Sun) studies which cast doubt on the 
ability of SMART
to predict failures. 


 which is why we were never really fans of turning it on.
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Google paper on disk reliability

2007-02-18 Thread Chris Ridd
On 18/2/07 4:56, Akhilesh Mritunjai [EMAIL PROTECTED] wrote:

 Hi Folks
 
 I believe that the word would have gone around already, Google engineers have
 published a paper on disk reliability. It might supplement the ZFS FMA
 integration and well - all the numerous debates on spares etc etc over here.
 
 To quote /.
 
 The Google engineers just published a paper on Failure Trends in a Large Disk
 Drive Population. Based on a study of 100,000 disk drives over 5 years they
 find some interesting stuff. To quote from the abstract: 'Our analysis
 identifies several parameters from the drive's self monitoring facility
 (SMART) that correlate highly with failures. Despite this high correlation, we
 conclude that models based on SMART parameters alone are unlikely to be useful
 for predicting individual drive failures. Surprisingly, we found that
 temperature and activity levels were much less correlated with drive failures
 than previously reported.'
 
 Link to the paper is http://labs.google.com/papers/disk_failures.pdf

There was another similar paper (written at CMU) given at the same
conference:

http://www.cs.cmu.edu/~bianca/fast07.pdf

Cheers,

Chris


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


[zfs-discuss] Google paper on disk reliability

2007-02-17 Thread Akhilesh Mritunjai
Hi Folks

I believe that the word would have gone around already, Google engineers have 
published a paper on disk reliability. It might supplement the ZFS FMA 
integration and well - all the numerous debates on spares etc etc over here.

To quote /.

The Google engineers just published a paper on Failure Trends in a Large Disk 
Drive Population. Based on a study of 100,000 disk drives over 5 years they 
find some interesting stuff. To quote from the abstract: 'Our analysis 
identifies several parameters from the drive's self monitoring facility (SMART) 
that correlate highly with failures. Despite this high correlation, we conclude 
that models based on SMART parameters alone are unlikely to be useful for 
predicting individual drive failures. Surprisingly, we found that temperature 
and activity levels were much less correlated with drive failures than 
previously reported.'

Link to the paper is http://labs.google.com/papers/disk_failures.pdf
 
 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss