Re: Truncating SHA2 hashes vs shortening a MAC for ZFS Crypto

Zooko Wilcox-O'Hearn Mon, 02 Nov 2009 13:29:33 -0800

Dear Darren J Moffat:

I don't understand why you need a MAC when you already have the hashof the ciphertext. Does it have something to do with the fact thatthe checksum is non-cryptographic by default (http://docs.sun.com/app/docs/doc/819-5461/ftyue?a=view ), and is that still true? Youroriginal design document [1] said you needed a way to force thechecksum to be SHA-256 if encryption was turned on. But back thenyou were planning to support non-authenticating modes like CBC. Iguess once you dropped non-authenticating modes then you could relaxthat requirement to force the checksum to be secure.

Too bad, though! Not only are you now tight on space in part becauseyou have two integrity values where one ought to do, but also asecure hash of the ciphertext is actually stronger than a MAC! Asecure hash of the ciphertext tells whether the ciphertext is right(assuming the hash function is secure and implemented correctly).Given that the ciphertext is right, then the plaintext is right(given that the encryption is implemented correctly and you use theright decryption key). A MAC on the plaintext tells you only thatthe plaintext was chosen by someone who knew the key. See what Imean? A MAC can't be used to give someone the ability to read somedata while withholding from them the ability to alter that data. Asecure hash can.

One of the founding ideas of the whole design of ZFS was end-to-endintegrity checking. It does that successfully now, for the case ofaccidents, using large checksums. If the checksum is secure then italso does it for the case of malice. In contrast a MAC doesn't do"end-to-end" integrity checking. For example, if you've previouslyallowed someone to read a filesystem (i.e., you've given them accessto the key), but you never gave them permission to write to it, butthey are able to exploit the isses that you mention at the beginningof [1] such as "Untrusted path to SAN", then the MAC can't stop themfrom altering the file, nor can the non-secure checksum, but a securehash can (provided that they can't overwrite all the way up theMerkle Tree of the whole pool and any copies of the Merkle Tree roothash).

Likewise, a secure hash can be relied on as a dedupe tag *even* ifsomeone with malicious intent may have slipped data into the pool.An insecure hash or a MAC tag can't -- a malicious actor could submitdata which would cause a collision in an insecure hash or a MAC tag,causing tag-based dedupe to mistakenly unify two different blocks.

So, since you're tight on space, it would be really nice if you couldtell your users to use a secure hash for the checksum and thenallocate more space to the secure hash value and less space to thenow-unnecessary MAC tag. :-)

Anyway, if this is the checksum which is used for dedupe thenremember the birthday so-called paradox -- some people may beuncomfortable with the prospect of not being able to safely dedupetheir 2^64-block storage pool if the hash is only 128 bits, forexample. :-) Maybe you could include the MAC tag in the dedupecomparison.

Also, the IVs for GCM don't need to be random, they need only to beunique. Can you use a block number and birth number or other suchguaranteed-unique data instead of storing an IV? (Apropos recentdiscussion on the cryptography list [2].)


Regards,

Zooko

[1] http://hub.opensolaris.org/bin/download/Project+zfs%2Dcrypto/files/zfs%2Dcrypto%2Ddesign.pdf

[2] http://www.mail-archive.com/[email protected]/msg11020.html
---
Your cloud storage provider does not need access to your data.
Tahoe-LAFS -- http://allmydata.org

---------------------------------------------------------------------
The Cryptography Mailing List
Unsubscribe by sending "unsubscribe cryptography" to [email protected]

Re: Truncating SHA2 hashes vs shortening a MAC for ZFS Crypto

Reply via email to