subject:"\[zfs\-discuss\] Best way to convert checksums"

Re: [zfs-discuss] Best way to convert checksums

2009-10-05 Thread Brandon Mercer

On Mon, Oct 5, 2009 at 10:27 AM, David Dyer-Bennet d...@dd-b.net wrote:

 On Sat, October 3, 2009 17:18, Ray Clark wrote:

 Thank you all for your help, not to snub anyone, but Darren, Richard, and
 Cindy especially come to mind.  Thanks for sparring with me until we
 understood each other.

 I'd like to echo this (and extend the thanks to include Ray).  I'm now
 starting to feel that I understand this issue, and I didn't for quite a
 while.  And that I understand the risks better, and have a clearer idea of
 what the possible fixes are.  And I didn't before.  That I do now is due
 to Ray's persistence, and to the rest of your patience.  Thank you!

Excellent, can this thread die now? :P
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Best way to convert checksums

2009-10-05 Thread Al Hopper

Question (for Richard E):  Is there a write-up on the ZFS broken fletcher fix?
Is the default checksum for new pool creation changed in U8?
Is the default checksum for new pool creation change in OpenSolaris or
SXCE  (which versions)?
Is there a case open to allow the user to select the checksum to be
used when a ZIL is being created?

Interesting thread - and commiserations to the team ZFS on the broken
fletcher implementation - we (developers) all have bad days!!

Regards,

-- 
Al Hopper  Logical Approach Inc,Plano,TX a...@logical-approach.com
   Voice: 972.379.2133 Timezone: US CDT
OpenSolaris Governing Board (OGB) Member - Apr 2005 to Mar 2007
http://www.opensolaris.org/os/community/ogb/ogb_2005-2007/
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Best way to convert checksums

2009-10-05 Thread Miles Nordin

 re == Richard Elling richard.ell...@gmail.com writes:

re As I said before, if the checksum matches, then the data is
re checked for sequence number = previous + 1, the blk_birth ==
re 0, and the size is correct. Since this data lives inside the
re block, it is unlikely that a collision would also result in a
re valid block.

That's just a description of how the zil works, not an additional
layer of protection for user data in the ZIL beyond the checksum.  The
point of all this is to avoid needing to write a synchronous commit
sector to mark the block valid.  Instead, the block becomes valid once
it's entirely written.  Yes, the checksum has an additional, critical,
use in the ZIL compared to its use in the bulk pool, but checking
these header fields for sanity does nothing to mitigate broken
fletcher2's weakness in detecting corruption of the user data stored
inside the zil records.  It's completely orthogonal.

If anything, the additional use of broken fletcher2 in the ZIL is a
reason it's even more important to fix the checksum in the ZIL:
checksum mismatches occur in the ZIL even during normal operation,
even when the storage is not misbehaving, because sometimes blocks are
incompletely written.  This is the normal case, not the exception,
because the ZIL is only read after unclean shutdown.

and AIUI you are saying fletcher2 is still the default for bulk pool
data, too?  even on newly created pools with the latest code?  The fix
was just to add the word ``deprecated'' to some documentation
somewhere, without actually performing the deprecation?  I feel like
FreeBSD/NetBSD would probably have left this bug open until it's
fixed.  :/  Ubuntu or Gentoo would probably keep closing and reopening
it though while people haggled in the comments section.


pgpiLfE4opE8N.pgp
Description: PGP signature
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Best way to convert checksums

2009-10-05 Thread Victor Latushkin

On 05.10.09 23:07, Miles Nordin wrote:

re == Richard Elling richard.ell...@gmail.com writes:

re As I said before, if the checksum matches, then the data is
re checked for sequence number = previous + 1, the blk_birth ==
re 0, and the size is correct. Since this data lives inside the
re block, it is unlikely that a collision would also result in a
re valid block.

That's just a description of how the zil works, not an additional
layer of protection for user data in the ZIL beyond the checksum. The
point of all this is to avoid needing to write a synchronous commit
sector to mark the block valid. Instead, the block becomes valid once
it's entirely written. Yes, the checksum has an additional, critical,
use in the ZIL compared to its use in the bulk pool, but checking
these header fields for sanity does nothing to mitigate broken
fletcher2's weakness in detecting corruption of the user data stored
inside the zil records. It's completely orthogonal.

If anything, the additional use of broken fletcher2 in the ZIL is a
reason it's even more important to fix the checksum in the ZIL:
checksum mismatches occur in the ZIL even during normal operation,
even when the storage is not misbehaving, because sometimes blocks are
incompletely written. This is the normal case, not the exception,
because the ZIL is only read after unclean shutdown.

and AIUI you are saying fletcher2 is still the default for bulk pool
data, too? even on newly created pools with the latest code?

Here's essentially the fix:

http://src.opensolaris.org/source/diff/onnv/onnv-gate/usr/src/uts/common/fs/zfs/sys/zio.h?r2=%252Fonnv%252Fonnv-gate%252Fusr%252Fsrc%252Futs%252Fcommon%252Ffs%252Fzfs%252Fsys%252Fzio.h%409454%3A02e1ddcc9be7r1=%252Fonnv%252Fonnv-gate%252Fusr%252Fsrc%252Futs%252Fcommon%252Ffs%252Fzfs%252Fsys%252Fzio.h%409443%3A2a96d8478e95

It changes setting of checksum=on to mean fletcher4, so it is used by default
for all user data and metadata. Though you can still set it to fletcher2
explicitly.

55 matches

Mail list logo