Re: Data Deduplication with the help of an online filesystem check

Thomas Glanzmann Mon, 04 May 2009 08:16:34 -0700

Hello Ric,

> (1) Block level or file level dedup?


what is the difference between the two?

> (2) Inband dedup (during a write) or background dedup?

I think inband dedup is way to intensive on ressources (memory) and also
would kill every performance benchmark. So I think the offline dedup is
the right way to go.

> (3) How reliably can you protect the pool of blocks? How reliably can
> you protect the database that maps hashes to blocks?

You have to lock down the i/o requests for the blocks in question and
compare them byte by byte anyway, just to make sure.

> (4) Can you give users who are somewhat jaded confidence in your
> solution (this is where stats come in very handy!)

For virtual machines you can reduce the used data by 1/3. Of course it
can blow in your face when you don't watch your physical resources
closely.

        Thomas
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Data Deduplication with the help of an online filesystem check

Reply via email to