Re: [Beowulf] clustering using xen virtualized machines

Tim Cutts Tue, 26 Jan 2010 05:29:15 -0800


On 26 Jan 2010, at 12:00 pm, Jonathan Aquilina wrote:

does anyone have any benchmarks for I/O in a virtualized cluster?

I don't have formal benchmarks, but I can tell you what I see on myVMware virtual machines in general:

Network I/O is reasonably fast - there's some additional latency, butnothing particularly severe. VMware can special-case communicationbetween VMs on the same physical host, if required, but that reducesflexibility in moving the VMs around.

Disk I/O is fairly poor, especially once the number of virtualmachines becomes large. This is hardly surprising - the VMs arecontending for shared resources, and there's bound to be morecontention in a virtualised setup than in physical machines.

In our case (~170 virtual machines running on 9 physical servers, eachof which has dual GigE for VM traffic and dual port fibrechannel)

Forgive me for using VMware parlance rather than Xen, but hopefullythe ideas will be the same. Here are a few things I've noted:

1) Applications with I/O patterns of large numbers of small diskoperations are particularly painful (such as our ganglia server withall its thousands of tiny updates to RRD files). We've mitigated thisby configuring Linux on this guest to allow a much larger proportionof dirty pages than usual, and to not flush to disk quite so often.OK, so I risk losing more data if the VM goes pop, but this is justganglia graphing, so I don't really care too much in that particularcase.

2) Raw device maps (where you pass a LUN straight through to a singlevirtual machine, rather than carving the disk out of a datastore)reduce contention and increase performance somewhat, at the cost ofusing up device minor numbers on ESX quite quickly; because ESX isbasically Linux, you're limited to 256 (I think - it might be 128)LUNs presented to each host, and probably to each cluster, since VMsneed to be able to migrate. I basically use RDMs for databaseapplications where the storage requirements are greater than about 500GB. For less than that I use datastores.

3) Keep the number of virtual machines per datastore quite low,especially if the applications are I/O heavy, to reduce contention.

4) In an ideal world I'd spread the datastores over a larger numberof RAID units than I currently have, but my budget can't stand that.

All this is rather dependent of course on what technology you're usingto provide storage to your virtual machines. We're usingfibrechannel, but of course mileage may vary considerably if you useNAS or iSCSI, and depending on how many NICs you're bonding togetherto get bandwidth.

--

The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE._______________________________________________

Beowulf mailing list, [email protected] sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] clustering using xen virtualized machines

Reply via email to