Re: [zfs-discuss] Alternatives to NFS for sharing ZFS

Jim Klimov Mon, 24 Oct 2011 07:05:22 -0700

2011-10-24 12:49, Humberto N. Castejon Martinez пишет:

Hi,
I would like to share my ZFS filesystem over the network and make itin addition fault tolerant. I am out after performance and faulttolerance, but I do not want to miss the advantages of deduplication,cloning and snapshoting offered by ZFS. I have read something aboutLustre being integrated with ZFS, so that could be an option, right?Could I also use, for example, MooseFS? Thanks!

Well, I do hope someone will prove me wrong, but here's how I see thesituation now:

A single ZFS pool (containing all the datasets, their snapshots andclones) can only be "imported" on one server at a time. So whatever theconfiguration (Lustre on top of ZFS datasets, or ZFS in Lustre volumes -if any of these is possible now at all), that single ZFS node would beyour bottleneck and SPOF.

For fault tolerance you might have two identical (equivalent) storagenodes serving same data and replicating changes to each other (withdouble the storage requirements), or you can make a HA system withshared storage equally accessible by two servers (all storage, includingcache SSDs which you should have in case of performance), with ZFS beingserved from no more than one of these servers at any time. Such a HAsystem might be made with dual-pathed direct connections (i.e. dual-portSAS enclosures, backplanes and further on - disks, including SSDs)connected to HBAs in two servers, or by SAN switch meshing.

In case of replication, it can be tricky to determine an authoritativeside in case of conflicts. If only one side is guaranteed to use acertain dataset in RW mode at any time interval, you could replicate itto the other side by sending snapshots. If access is file-based and acertain file is only changed at one side, you might use an rsync loop toreplicate these changes continually.

Either way, having several ZFS nodes with identical data is your bestshot at parallel performance and fault-tolerance at once (NFS client canbe configured to failover between identical servers), as long as you canfigure out the RW mastership. I am not sure if you can designate asingle NFS server as the write master and several others as readonlyslaves (like you can with LDAP servers, for example), but even then yourNFS clients would have to allow some time for write replications topropagate to slave nodes. During that time, reads (of recent changes)should also be handled by the write-master.

All-in-all, to me now this seems like a tricky quest (which I ponderedfor a while and abandoned for now). I would be happy to read that it isindeed possible and how that's doable ;)


//Jim


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Alternatives to NFS for sharing ZFS

Reply via email to