Re: [Gluster-users] Recommended Stripe Width

Keith Freedman Fri, 20 Feb 2009 23:57:49 -0800

At 11:02 PM 2/20/2009, Jordan Mendler wrote:

I am prototyping GlusterFS with ~50-60TB of raw disk space acrossnon-raided disks in ~30 compute nodes. I initially separated thenodes into groups of two, and did a replicate across each set ofsingle drives in a pair of servers. Next I did a stripe across the33 resulting AFR groups, with a block size of 1MB and later with thedefault block size. With these configurations I am only seeingthroughput of about 15-25 MB/s, despite a full Gig-E network.
What is generally the recommended configuration in a large stripedenvironment? I am wondering if the number of nodes in the stripe iscausing too much overhead, or if the bottleneck is likely somewhereelse. In addition, I saw a thread on the list that indicates it isbetter to replicate across stripes rather than stripe acrossreplicates. Does anyone have any comments or opinion regarding this?

I think that's all guesswork, I'm not sure anyones done a thoroughtest with gluster 2.0 on those choices.Personally, from a data management perspective, I'd rather replicatethen stripe, so that I know that each node in a replica has exactlythe same data. With striping then replicating, I imagine there isthe possibility to have some data that's on one node in one stripeset on 2 nodes in another stripe set and this causes a problem if youhave to take it apart or deal with it later.

However, if you have the time, it'd be great to see results of youtesting with a 15 node stripe and a 10 node stripe to see how thosenumbers rate vs. the 30 node stripe you have now.

then, flip the replication and do the same tests again.

Keith



_______________________________________________
Gluster-users mailing list
[email protected]
http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users

Re: [Gluster-users] Recommended Stripe Width

Reply via email to