Re: [zfs-discuss] How does ZFS snapshot COW file data?

Eric Hamilton Thu, 05 Jul 2007 15:29:36 -0700

Thanks, Darren.

I've taken the liberty of reordering my follow-up for clarity. Just tobe clear, I'm not critiquing ZFS, just trying to learn it by pushing atsome of the corner cases of filesystem-system interactions. I'm tryingto figure out the implications of various ZFS features when used invarious ways.

Does that mean that paging out dirty mmap pages go to new places and
require metadata updates as well?

Yes.

Indeed, given that ZFS always writes new places (except for uberblock asyou noted), then that does make the snapshots easier and accounts forthere being no explicit code to "set COW on disk blocks", or such.

From a ufs/vxfs background, the thought of allocating additional diskstorage to page out to a memory mapped file or of changing metadata topoint to new blocks and hence needing to write out metadata as part ofpaging out modified file data seems foreign to me. If the metadatawrites can be bundled into the same transaction, perhaps there's no moreserialization latency on a pageout... ? Is this also another case whereone might get ENOSPC when one doesn't on other filesystems (paging outto an existing MMF in a full ZFS pool)?

From the comments in the source tour about the ZIL, I did note thestatement that file contents do not go through the ZIL unless needed forO_DSYNC or fsynch() semantics, so I wasn't sure how else they might bedifferent.

How do snapshots interact with open files or files with pages in the
OpenSolaris page cache?


I don't believe they do.  Are you thinking of something in particular?

I am generally interested in understanding file consistency and cachecoherency. I'd like to know what exactly is being snapshotted and whatis consistent within a snapshot.

I subsequently saw an earlier thread on "ZFS consistency guarantee"(http://www.opensolaris.org/jive/thread.jspa;?messageID=124809) whereyou and others pointed out that application state is not consistent at asnapshot unless the application has been quiesced or otherwise broughtto a consistent state. Even then, I'm curious about the interactionwith the OpenSolaris page cache...

As is generally known and is explained well by Roch Bourbonnais inhttp://blogs.sun.com/roch/entry/nfs_and_zfs_a_fine, NFS places an extrarequirement for committing writes to stable storage upon file close.For local filesystems, a close() will complete without all modified filedata being written to disk yet. Does all such file data get into asnapshot, or only data that has happened to be pushed out to disk by thetime of the snapshot? (e.g. local open(), write(), close(), snapshot).

It does look to me from the comment and call to zil_suspend from withindmu_objset_snapshot_one that any changes that have made it to thefilesystem will get flushed out and included in the snapshot. Thisshould apply to any metadata operations that have completed such aslink, unlink, etc. But if the VM system is caching file contents aftera close (or at least nobody has pushed it out yet), is there any way toguarantee that such data makes it into the snapshot?

In my earlier question I was also thinking about about other cases suchas MMF where application has written to page with or without msync() oropen file after write() but no fsync(). Depending upon how data ofclosed files are handled, those may be moot.


Eric

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] How does ZFS snapshot COW file data?

Reply via email to