Re: [DRBD-user] Need assistance with a performance problem

listslut Tue, 09 Aug 2011 20:53:54 -0700

The only other 'overt' difference was that Fedora is on ext4. I don'tknow that I'll test Fedora with ext3. I'd have to do too much to getRHEL5 to run on ext4 at this point and getting my existing RHEL vm's tohave decent write performance was the goal anyhow.

Ken


On 09/08/11 05:58 PM, Zev Weiss wrote:

On Aug 9, 2011, at 7:50 AM, Jean-Francois Chevrette wrote:
Hi everyone,
we have this fairly simple setup where we have two CentOS 5.5 nodesrunning xen 3.4.2 compiled from sources (kernel 2.6.18-xen) and DRBD8.3.7 also compiled from sources. Both nodes have two data partitionswhich are synced by DRBD. Each node is running a single VM fromeither of the partitions in a standard Primary/Secondary mode. Thisway each node can fully utilize its CPU and memory resources and westill have storage failover capabilities. The VMs are using the drbddevices directly (no LVM and such). Both nodes are connected througha gigabit ethernet port and a crossover cable.
Over time as the VM resource usage raised it started behavingstrangely. After investigating, everything points to an IO problem asread and writes are very slow.
My tests have shows that while the DRBD replication is connected andrunning, IO performance is very bad. Not only is it bad inside the VMbut also on the host node. This is as if DRBD would cause theunderlying IO subsystem to become very slow. Now I should say thatthe servers are using Adaptec 5405 raid cards with BBUs and writecache enabled. As for disks, we have 4x SATA drives configured as aRAID-10.
As soon as I disconnect DRBD, the IO performance is way better bothinside and outside the VMs.
<snip>
Hi Jean-Francois,
I have also been having major performance problems using a similarsetup. One thing that makes me thing there might be two differentproblems at hand here though is that you report both reads and writesbeing slow -- for me, read performance has been OK, but DRBD slowsdown my disk writes enormously.
Have you tried running the throughput & latency testing scripts in theDRBD user guide? If so I'd be curious to see what results you get.On my system I get about 50% of the throughput via the DRBD devicethat I get on the underlying LVM volume, and I get about a 100xincrease in latency via DRBD as compared to the raw LogVol, so mysystems get almost completely unresponsive when MySQL starts doinglots of small writes (for example I've measured syslog's fsync()staking 5-10 full seconds to complete).
My current theory is that this may be some nasty interaction with the2.6.18-based Red Hat (or CentOS, in your case) kernel, since that'swhat I'm running and another poster here said he'd been getting poorperformance on a RH system but good performance on Fedora (with anewer kernel). I'm currently making an attempt at trying it on avanilla 3.0.1 kernel I compiled from a kernel.org source tarball andxen 4.1.1 (also compiled from source), but I'm not sure if I'm goingto be able to get a full two-node system set up that way in order toreally do a comprehensive test.
If you find out anything more about it or discover a solution, pleasedo post to the list!
Thanks,
Zev Weiss

_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user


_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

Re: [DRBD-user] Need assistance with a performance problem

Reply via email to