On Feb 9, 2009, at 7:06 PM, Jeff Bonwick wrote:
>>There is no substitute for cord-yank tests - many and often. The
>>weird part is, the ZFS design team simulated millions of them.  
>>So the full explanation remains to be uncovered?
>
>We simulated power failure; we did not simulate disks that simply
>blow off write ordering.  Any disk that you'd ever deploy in an  
>enterprise or storage appliance context gets this right.
>
>The good news is that ZFS is getting popular enough on consumer-grade
>hardware.  The bad news is that said hardware has a different set of 
>failure modes, so it takes a bit of work to become resilient to them.
>This is pretty high on my short list.

Speaking of "modes of failure": historically fsck has been used for
slightly different (although related purposes):
   0. as a tool capable of restoring consistency in a FS that didn't
   guarantee an always consistent on-disk state
   
   1. as a forensics tool that would let you retrieve as much
information
   as possible from a physically ill device
   
Thanks goodness, ZFS doesn't need fsck for #0. That still leaves
#1. So far all we have in that department is zdb/mdb. These two 
can do wonders when used by professionals, yet still fall
into "don't try that at home" category for everybody else.

Does such a tool sound reasonable? Does it have a chance of
ever showing up on your list?

Thanks,
Roman.
-- 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to