Re: Offline Deduplication for Btrfs

2011-01-05 Thread Ray Van Dolson
On Wed, Jan 05, 2011 at 07:41:13PM +0100, Diego Calleja wrote: On Miércoles, 5 de Enero de 2011 18:42:42 Gordan Bobic escribió: So by doing the hash indexing offline, the total amount of disk I/O required effectively doubles, and the amount of CPU spent on doing the hashing is in no way

Re: Btrfs TODO

2009-04-09 Thread Ray Van Dolson
On Thu, Apr 09, 2009 at 12:24:18PM -0400, Josef Bacik wrote: Hello, Trying to put together a list of TODO items for btrfs so we can update the wiki page fully. So far these things are on the list * Proper ENOSPC handling * O_DIRECT support (without checksumming) * AIO support *

Re: Data De-duplication

2008-12-10 Thread Ray Van Dolson
I lost the original post so I'm jumping in at the wrong thread-point :) Someone mentioned that the primary usage of de-dup is in the backup realm. True perhaps currently, but de-dup IMO is *the* killer app in the world of virtualization and is a huge reason why we're picking NetApp at work to

Re: Data De-duplication

2008-12-10 Thread Ray Van Dolson
On Wed, Dec 10, 2008 at 01:57:54PM -0800, Tracy Reed wrote: On Wed, Dec 10, 2008 at 09:42:16PM +, Oliver Mattos spake thusly: I'm considering writing that script to test on my ext3 disk just to see how much duplicate wasted data I really have. Check out the fdupes command. In Fedora 8