Re: [BBLISA] shared network disks - vs gfs - vs distributed filesystem - vs ...

John Hanks Wed, 01 Jul 2009 16:25:11 -0700

What follows are the ramblings of a bored person on a train with aniPhone and an hour to kill. Probably you should filter this.

In the past I have built shared filesystems using drbd to duplicate ablock device then running ocfs2 as the filesystem. This worked wellfor building a redundant Xen setup with two servers, but probablywouldn't scale well.

If you mounted your scratch at /local/nodename-scratch and let everynode export this via nfs, then all nodes could mount each others localscratch while preserving thier individuality as well as makingcontents widely availble to noncompute hosts if useful to do so.

Pvfs is a good option for this if you want the space all combined intoa single volume. Could possibly see some improved performancedepending on the shape of your I/o and compute load. Simply because ofmtbf of drives this solution has robustness inversely proportional tonode count unless you layer in some approach to disk redundancy. (drbdbetween node pairs with some sort of failover would be a fun way tospend all you free time configuring)

Lustre is an option but is purported to be a management pain to keeprunning. Similar to pvfs otherwise.

Glusterfs also looks very interesting for this type of work but I'venot scratched the propoganda deep enough to understand how it worksunder the hood.

I'd love to hear about other options and what you eventually getworking.


jbh

Sent from my iPhone

On Jul 1, 2009, at 2:15 PM, Edward Ned Harvey <[email protected]>wrote:

I have a bunch of compute servers. They all have local disksmounted as /scratch to use for computation scratch space. Thisensures maximum performance on all systems, and no competition for ashared resource during crunch time. At present, all of their /scratch directories are local, separate and distinct. I think itwould be awesome if /scratch looked the same on all systems. Doesanyone know of a way to “unify” this storage, withoutcompromising performance? Of course, if some files reside on serverA, and they are requested from server B, then the files must go across the network, but I don’t want the files to go across the networkunless they are requested. And yet, if you do something like “ls /scratch” you would ideally get the same results regardless of whichmachine you’re on.
Due to the nature of heavy runtime IO (read, seek, write, repeat…) it’s not well suited to NFS or any network filesystem… Due to thenature of many systems all doing the same thing at the same time, it’s not well suited to a SAN using shared disks…
I looked at gfs (the cluster filesystem) – but – it seems gfsassumes a shared disk (like a san) in which case there is competition for a shared resource.
I looked at gfs (the google filesystem) – but – it seems theyconstantly push all the data across the network, which is good for redundancy and mostly-just-read operations, and not good for heavy computation IO.
Not sure what else I should look at.  Any ideas?



TIA.

_______________________________________________
bblisa mailing list
[email protected]
http://www.bblisa.org/mailman/listinfo/bblisa


_______________________________________________
bblisa mailing list
[email protected]
http://www.bblisa.org/mailman/listinfo/bblisa

Re: [BBLISA] shared network disks - vs gfs - vs distributed filesystem - vs ...

Reply via email to