[zfs-discuss] Improving L1ARC cache efficiency with dedup

Brad Diggs Wed, 07 Dec 2011 10:48:24 -0800

Hello,

I have a hypothetical question regarding ZFS reduplication. Does the L1ARC cache benefit from reduplication

in the sense that the L1ARC will only need to cache one copy of the reduplicated data versus many copies?

Here is an example:

Imagine that I have a server with 2TB of RAM and a PB of disk storage. On this server I create a single 1TB

data file that is full of unique data. Then I make 9 copies of that file giving each file a unique name and

location within the same ZFS zpool. If I start up 10 application instances where each application reads all of

its own unique copy of the data, will the L1ARC contain only the deduplicated data or will it cache separate

copies the data from each file? In simpler terms, will the L1ARC require 10TB of RAM or just 1TB of RAM to

cache all 10 1TB files worth of data?

My hope is that since the data only physically occupies 1TB of storage via deduplication that the L1ARC

will also only require 1TB of RAM for the data.

Note that I know the deduplication table will use the L1ARC as well. However, the focus of my question

is on how the L1ARC would benefit from a data caching standpoint.

Thanks in advance!

Brad

Brad Diggs | Principal Sales Consultant

Tech Blog: http://TheZoneManager.com

LinkedIn: http://www.linkedin.com/in/braddiggs

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] Improving L1ARC cache efficiency with dedup

Reply via email to