Re: [Linux-cluster] dlm and IO speed problem

Wendy Cheng Fri, 11 Apr 2008 08:29:15 -0700

christopher barry wrote:

On Tue, 2008-04-08 at 09:37 -0500, Wendy Cheng wrote:

[EMAIL PROTECTED] wrote:

my setup:
6 rh4.5 nodes, gfs1 v6.1, behind redundant LVS directors. I know it's
not new stuff, but corporate standards dictated the rev of rhat.

[...]

I'm noticing huge differences in compile times - or any home file access
really - when doing stuff in the same home directory on the gfs on
different nodes. For instance, the same compile on one node is ~12
minutes - on another it's 18 minutes or more (not running concurrently).
I'm also seeing weird random pauses in writes, like saving a file in vi,
what would normally take less than a second, may take up to 10 seconds.


Anyway, thought I would re-connect to you all and let you know how this
worked out. We ended up scrapping gfs. Not because it's not a great fs,
but because I was using it in a way that was playing to it's weak
points. I had a lot of time and energy invested in it, and it was hard
to let it go. Turns out that connecting to the NetApp filer via nfs is
faster for this workload. I couldn't believe it either, as my bonnie and
dd type tests showed gfs to be faster. But for the use case of large
sets of very small files, and lots of stats going on, gfs simply cannot
compete with NetApp's nfs implementation. GFS is an excellent fs, and it
has it's place in the landscape - but for a development build system,
the NetApp is simply phenomenal.

Assuming you run both configurations (nfs-wafl vs. gfs-san) on the verysame netapp box (?) ...

Both configurations have their pros and cons. The wafl-nfs runs onnative mode that certainly has its advantages - you've made a goodchoice but the latter (gfs-on-netapp san) can work well in othersituations. The biggest problem with your original configuration is theload-balancer. The round-robin (and its variants) scheduling will notwork well if you have a write intensive workload that needs to fight forlocks between multiple GFS nodes. IIRC, there are gfs customers runningon build-compile development environment. They normally assign groups ofusers on different GFS nodes, say user id starting with a-e on node 1,f-j on node2, etc.

One encouraging news from this email is gfs-netapp-san runs well onbonnie. GFS1 has been struggling with bonnie (large amount of smallerfiles within one single node) for a very long time. One of the reasonsis its block allocation tends to get spread across the disk wheneverthere are resource group contentions. It is very difficult for linux IOscheduler to merge these blocks within one single server. When theworkload becomes IO-bound, the locks are subsequently stalled andeverything start to snow-ball after that. Netapp SAN has one more layerof block allocation indirection within its firmware and its write speedis "phenomenal" (I'm borrowing your words ;) ), mostly to do with theNVRAM where it can aggressively cache write data - this helps GFS torelieve its small file issue quite well.


-- Wendy

--
Linux-cluster mailing list
Linux-cluster@redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

Re: [Linux-cluster] dlm and IO speed problem

Reply via email to