Re[2]: [zfs-discuss] Google paper on disk reliability
Hello Jesus, Wednesday, February 21, 2007, 5:54:35 AM, you wrote: JC -BEGIN PGP SIGNED MESSAGE- JC Hash: SHA1 JC Joerg Schilling wrote: What they missed to say is that you need to access the whole disk frequently enough in order to give SMART the ability to work. JC I thought modern disks could be instructed to do offline scanning, JC using any idle time available. it was mentioned also in the paper -- Best regards, Robertmailto:[EMAIL PROTECTED] http://milek.blogspot.com ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
Re: [zfs-discuss] Google paper on disk reliability
Richard Elling [EMAIL PROTECTED] wrote: Link to the paper is http://labs.google.com/papers/disk_failures.pdf As for the spares debate, that is easy: use spares :-) What they missed to say is that you need to access the whole disk frequently enough in order to give SMART the ability to work. Jörg -- EMail:[EMAIL PROTECTED] (home) Jörg Schilling D-13353 Berlin [EMAIL PROTECTED](uni) [EMAIL PROTECTED] (work) Blog: http://schily.blogspot.com/ URL: http://cdrecord.berlios.de/old/private/ ftp://ftp.berlios.de/pub/schily ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
Re: [zfs-discuss] Google paper on disk reliability
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Joerg Schilling wrote: What they missed to say is that you need to access the whole disk frequently enough in order to give SMART the ability to work. I thought modern disks could be instructed to do offline scanning, using any idle time available. [EMAIL PROTECTED] video]# smartctl -a /dev/hda ... General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. ... - -- Jesus Cea Avion _/_/ _/_/_/_/_/_/ [EMAIL PROTECTED] http://www.argo.es/~jcea/ _/_/_/_/ _/_/_/_/ _/_/ jabber / xmpp:[EMAIL PROTECTED] _/_/_/_/ _/_/_/_/_/ _/_/ _/_/_/_/ _/_/ _/_/ Things are not so easy _/_/ _/_/_/_/ _/_/_/_/ _/_/ My name is Dump, Core Dump _/_/_/_/_/_/ _/_/ _/_/ El amor es poner tu felicidad en la felicidad de otro - Leibniz -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iQCVAwUBRdvQi5lgi5GaxT1NAQKbPwP+N9PtmXu/bO3YegGtppZzo3McWanUVBAr rfnW10AbrYZ1RgtqQ/nofB8AugzK/zkIuB80EyUFraJ0ZvxMEKgtK9mQilwWiA3f TOQOUPq/uwzK2y6XtQUwfhnWqbXJPAWYPdQ1nBxEKRBtyarjxG7rE9MbsWMJ7lj2 EY1zf9OoEgg= =kcIg -END PGP SIGNATURE- ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
Re: [zfs-discuss] Google paper on disk reliability
Akhilesh Mritunjai wrote: I believe that the word would have gone around already, Google engineers have published a paper on disk reliability. It might supplement the ZFS FMA integration and well - all the numerous debates on spares etc etc over here. Good paper. They validate the old saying, complex systems fail in complex ways. We've also done some internal (Sun) studies which cast doubt on the ability of SMART to predict failures. To quote /. The Google engineers just published a paper on Failure Trends in a Large Disk Drive Population. Based on a study of 100,000 disk drives over 5 years they find some interesting stuff. To quote from the abstract: 'Our analysis identifies several parameters from the drive's self monitoring facility (SMART) that correlate highly with failures. Despite this high correlation, we conclude that models based on SMART parameters alone are unlikely to be useful for predicting individual drive failures. Surprisingly, we found that temperature and activity levels were much less correlated with drive failures than previously reported.' Link to the paper is http://labs.google.com/papers/disk_failures.pdf As for the spares debate, that is easy: use spares :-) -- richard ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
Re: [zfs-discuss] Google paper on disk reliability
Richard Elling wrote: Akhilesh Mritunjai wrote: I believe that the word would have gone around already, Google engineers have published a paper on disk reliability. It might supplement the ZFS FMA integration and well - all the numerous debates on spares etc etc over here. Good paper. They validate the old saying, complex systems fail in complex ways. We've also done some internal (Sun) studies which cast doubt on the ability of SMART to predict failures. which is why we were never really fans of turning it on. ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
Re: [zfs-discuss] Google paper on disk reliability
On 18/2/07 4:56, Akhilesh Mritunjai [EMAIL PROTECTED] wrote: Hi Folks I believe that the word would have gone around already, Google engineers have published a paper on disk reliability. It might supplement the ZFS FMA integration and well - all the numerous debates on spares etc etc over here. To quote /. The Google engineers just published a paper on Failure Trends in a Large Disk Drive Population. Based on a study of 100,000 disk drives over 5 years they find some interesting stuff. To quote from the abstract: 'Our analysis identifies several parameters from the drive's self monitoring facility (SMART) that correlate highly with failures. Despite this high correlation, we conclude that models based on SMART parameters alone are unlikely to be useful for predicting individual drive failures. Surprisingly, we found that temperature and activity levels were much less correlated with drive failures than previously reported.' Link to the paper is http://labs.google.com/papers/disk_failures.pdf There was another similar paper (written at CMU) given at the same conference: http://www.cs.cmu.edu/~bianca/fast07.pdf Cheers, Chris ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
[zfs-discuss] Google paper on disk reliability
Hi Folks I believe that the word would have gone around already, Google engineers have published a paper on disk reliability. It might supplement the ZFS FMA integration and well - all the numerous debates on spares etc etc over here. To quote /. The Google engineers just published a paper on Failure Trends in a Large Disk Drive Population. Based on a study of 100,000 disk drives over 5 years they find some interesting stuff. To quote from the abstract: 'Our analysis identifies several parameters from the drive's self monitoring facility (SMART) that correlate highly with failures. Despite this high correlation, we conclude that models based on SMART parameters alone are unlikely to be useful for predicting individual drive failures. Surprisingly, we found that temperature and activity levels were much less correlated with drive failures than previously reported.' Link to the paper is http://labs.google.com/papers/disk_failures.pdf This message posted from opensolaris.org ___ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss