Re: Offline Deduplication for Btrfs

Gordan Bobic Thu, 06 Jan 2011 06:00:55 -0800

Peter A wrote:

On Thursday, January 06, 2011 05:48:18 am you wrote:

Can you elaborate what you're talking about here? How does the length of
a directory name affect alignment of file block contents? I don't see
how variability of length matters, other than to make things a lot more
complicated.

I'm saying in a filesystem it doesn't matter - if you bundle everything into abackup stream, it does. Think of tar. 512 byte allignment. I tar up adirectory with 8TB total size. No big deal. Now I create a new, empty file inthis dir with a name that just happens to be the first in the dir. This adds512 bytes close to the beginning of the tar file the second time I run tar. Nowthe remainder of the is all offset by 512bytes and, if you do dedupe on fs-
block sized chunks larger than the 512bytes, not a single byte will be de-
duped.

OK, I get what you mean now. And I don't think this is something thatshould be solved in the file system.

I know its a stupid example but it matches the backup-target and database dumpusage pattern really well. Files backing iSCSI shows similar dedupe behavior.Essentially every time you bundle mutliple files together into one you run intothings like that.

I suspect a large part of the underlying cause of this is that mostthings still operate on 512 byte sectors. Once that is replaced with 4KBsectors, that will probably go away. And if this is the case, thenperhaps making the block size "variable" to 512, 1024, 2048 and 4096bytes (probably in reverse order) will achieve most of that benefit.

Whether than is a worthwhile thing to do for poorly designed backupsolutions, but I'm not convinced about the general use-case. It'd bevery expensive and complicated for seemingly very limited benefit.

Have you some real research/scientifically gathered data
(non-hearsay/non-anecdotal) on the underlying reasons for the
discrepancy in the deduping effectiveness you describe? 3-17x difference
doesn't plausibly come purely from fixed vs. variable length block sizes.

Personal experience isn't hearsay :) Netapp publishes a whitepaper against the7000 making this a big point but that isn't publicly available. Try search"zfs dedupe +netbackup" or "zfs dedupe +datadomain" and similar - you willhear of hundreds of people all complain about the same thing.

Typical. And no doubt they complain that ZFS isn't doing what they want,rather than netbackup not co-operating. The solution to one misdesignisn't an expensive bodge. The solution to this particular problem is tomake netbackup work on per-file rather than per stream basis.


Gordan
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Offline Deduplication for Btrfs

Reply via email to