On Tue, Jul 6, 2010 at 10:29 AM, Arne Jansen <sensi...@gmx.net> wrote:
> Daniel Carosone wrote:
>> Something similar would be useful, and much more readily achievable,
>> from ZFS from such an application, and many others.  Rather than a way
>> to compare reliably between two files for identity, I'ld liek a way to
>> compare identity of a single file between two points in time.  If my
>> application can tell quickly that the file content is unaltered since
>> last time I saw the file, I can avoid rehashing the content and use a
>> stored value. If I can achieve this result for a whole directory
>> tree, even better.
>
> This would be great for any kind of archiving software. Aren't zfs checksums
> already ready to solve this? If a file changes, it's dnodes' checksum changes,
> the checksum of the directory it is in and so forth all the way up to the
> uberblock.
> There may be ways a checksum changes without a real change in the files 
> content,
> but the other way round should hold. If the checksum didn't change, the file
> didn't change.
> So the only missing link is a way to determine zfs's checksum for a
> file/directory/dataset. Am I missing something here? Of course atime update
> should be turned off, otherwise the checksum will get changed by the archiving
> agent.

What is the likelihood that the same data is re-written to the file?
If that is unlikely, it looks as though znode_t's z_seq may be useful.
 While it isn't a checksum, it seems to be incremented on every file
change.

-- 
Mike Gerdts
http://mgerdts.blogspot.com/
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to