[zfs-discuss] ZFS and SNDR..., now I'm confused.

Jim Dunham Fri, 06 Mar 2009 05:26:07 -0800

A recent increase in email about ZFS and SNDR (the replicationcomponent of Availability Suite), has given me reasons to post one ofmy replies.

Well, now I'm confused! A collegue just pointed me towards your blogentry about SNDR and ZFS which, until now, I thought was not asupported configuration. So, could you confirm that for me one wayor the other?

ZFS is supported with SNDR, because SNDR is filesystem agnostic. Thatsaid, ZFS is a very different beast then other Solaris filesystems.


The two golden rules of ZFS replication are:

1). All volumes in a ZFS storage pool (see output of zpool status),must be placed in a single SNDR I/O consistency group. ZFS is thefirst Solaris filesystem that validates consistency at all levels, soall vdevs in a single storage pool must be replicated in a write-orderconsistent manner, and I/O consistency groups is the means toaccomplish this.

2). While SNDR replication is active, do not attempt to zpool importthe SNDR secondary volumes, and while the ZFS storage pool is importedon the SNDR secondary node, do not resume replication. This is truly adouble-edge sword, as the instance of ZFS running on the SNDRsecondary node, will see replicated writes from ZFS on the SNDRprimary node, consider these unknown CRCs as some form of datacorruption, and panic Solaris. This is the same reason two or moreSolaris hosts can't access the same ZFS storage pool in a SAN.

There is a slight safety net here, in that zpool import will thinkthat the ZFS storage pool is active on another node. Unfortunatelystopping replication does not change this state, so you will stillneed to use the -f (force) option anyway, that is unless the zpool isin the exported state on the SNDR primary node, as the exported statewill be replicated to the SNDR secondary node.

Of course I know that AVS only cares about blocks so, in principle,the FS is irrelevant. However, last time I was researching this, Ifound a doc that explained that the lack of support was due to theunpredictable nature of zfs background processes (resilver, etc) andtherefore not being guaranteed of a truly quiesced FS.

ZFS the filesystem is always on disk consistent, and ZFS does maintainfilesystem consistency through coordination between the ZPL (ZFS POSIXLayer) and the ZIL (ZFS Intent Log). Unfortunately for SNDR, ZFScaches a lot of an applications filesystem data in the ZIL, thereforethe data is in memory, not written to disk, so SNDR does not know thisdata exists. ZIL flushes to disk can be seconds behind the actualapplication writes completing, and if SNDR is running asynchronously,these replicated writes to the SNDR secondary can be additionalseconds behind the actual application writes.

Unlike UFS filesystems and lockfs -f, or lockfs -w, there is no'supported' way to get ZFS to empty the ZIL to disk on demand. So eventhough one will get both ZFS and application filesystem consistencywithin the SNDR secondary volume, there can be many seconds worth oflost data, since SNDR can't replicate what it does not see.

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

[zfs-discuss] ZFS and SNDR..., now I'm confused.

Reply via email to