Peter,

That is right - some of them do not. My point was that Veritas fs already has many things implemented, like parallel fsck, copy-on-write checkpoints,etc. If it was used as a backend for the Lustre, that would be the perfect match. ZFS has some of its features, but not all.

But, let's say, adding things like that into the Lustre itself will make it even more complex, and now it is very complex already . Certainly, things like checkpoints can be added at MDT level - consider an inode on MDT pointing to another MDT inode, instead of the OST objects - that would be a clone. If the file is modified, then, the MDT inode becomes pointing to an OST object which keeps changed file blocks only. This will be sort of the checkpoint allowing to revert the file back. Well, this is is known to help restoring the data in case of the human error, or an application bug, it won't help to protect from HW induced errors. But, the parallel fsck issue is sort of standing alone - if we want fsck to be faster, we better make it parallel at every OST level - that's why I think this has to be done on the backend side.

Dmitry


Peter Braam wrote:
Dmitry, The point of the note is the opposite of what you write, namely that backend systems in fact do not solve this, unless they are guaranteed to be bug free.

Peter

On Fri, Jul 2, 2010 at 2:52 PM, Dmitry Zogin <[email protected] <mailto:[email protected]>> wrote:

    Hello Peter,

    These are really good questions posted there, but I don't think
    they are Lustre specific. These issues are sort of common to any
    file systems. Some of the mature file systems, like Veritas
    already solved this by

    1. Integrating the Volume management and File system. The file
    system can be spread across many volumes.
    2. Dividing the file system into a group of file sets(like data,
    metadata, checkpoints) , and allowing the policies to keep
    different filesets on different volumes.
    3. Creating the checkpoints (they are sort of like volume
    snapshots, but they are created inside the file system itself).
    The checkpoints are simply the copy-on-write filesets created
    instantly inside the fs itself. Using copy-on-write techniques
    allows to save the physical space and make the process of the file
    sets creation instantaneous. They do allow to revert back to a
    certain point instantaneously, as the modified blocks are kept
    aside, and the only thing that has to be done is to point back to
    the old blocks of information.
    4. Parallel fsck - if the filesystem consists of the allocation
    units - a sort of the sub- file systems, or cylinder groups,  then
    the fsck can be started in parallel on those units.

    Well, the ZFS does solve many of these issues, but in a different
    way, too.
    So, my point is that this probably has to be solved on the backend
    side of the Lustre, rather than inside the Lustre.

    Best regards,

    Dmitry

    Peter Braam wrote:
    I wrote a blog post that pertains to Lustre scalability and data
    integrity.  You can find it here:

    http://braamstorage.blogspot.com

    Regards,

    Peter
    ------------------------------------------------------------------------

    _______________________________________________
    Lustre-devel mailing list
    [email protected] <mailto:[email protected]>
    http://lists.lustre.org/mailman/listinfo/lustre-devel


------------------------------------------------------------------------

_______________________________________________
Lustre-devel mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-devel

_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to