Re: [smartos-discuss] Question regarding package sizing

Nahum Shalman Mon, 15 Feb 2016 06:41:09 -0800

On 02/15/2016 01:30 AM, Benjamin Bergia wrote:

Hi,
I recently noticed that all the packages used by SDC zones are usingsome strange settings. All of them, even database ones, are using 1vCPU with a cap of 400. I can picture in my head the "meaning" of theCPU cap when the cap is lower than the sum of the CPU percentages. Sosomething like 4 vCPU and a cap of 200 make sense to me.
Can somebody explain me what happens with a setting of let say 1 vCPUand a cap of 200 ?
In the SDC case what was the idea behind having this kind of settings? Does it give any performance/portability/else improvement ?
I am rethinking my packages and where I previously used settings likeyou would use on vSphere, I am no wondering if I am not doing ittotally wrong.

There are 3 important concepts. vCPUs, cpu_shares, and cpu_cap. From thevmadm man page (my further comments below that):


       vcpus:

For KVM VMs this parameter defines the number of virtualCPUs the guest

           will see. Generally recommended to be a multiple of 2.

           type: integer (number of CPUs)
           vmtype: KVM
           listable: yes
           create: KVM only
           update: KVM only (requires VM reboot to take effect)
           default: 1

       cpu_shares:

Sets a limit on the number of fair share scheduler (FSS) CPUshares fora VM. This value is relative to all other VMs on the system,so a valueonly has meaning in relation to other VMs. If you have oneVM with aa value 10 and another with a value of 50, the VM with 50will get 5x

           as much time from the scheduler as the one with 10 when there is
           contention.

           type: integer (number of shares)
           vmtype: OS,KVM
           listable: yes
           create: yes
           update: yes (live update)
           default: 100

       cpu_cap:

Sets a limit on the amount of CPU time that can be used by aVM. Theunit used is the percentage of a single CPU that can be usedby the VM.

           Eg. a value of 300 means up to 3 full CPUs.

           type: integer (percentage of single CPUs)
           vmtype: OS,KVM
           listable: yes
           create: yes
           update: yes (live update)

First, note that "vcpus" from a SmartOS perspective only applies to KVMVMs. That setting determines how many processors the VM can see and thuscan use to schedule its processes. We'll come back to that.

Zones (both the kind containing the QEMU process for KVM VMs and regularLX and joyent branded ones) can see all of the physical processors andthe OS can schedule processes on any of them.If you imagine yourself as a multi-tenant cloud provider you'll quicklyrealize that you need two things:1. Fairness (preventing noisy neighbors) when the system is fullyloaded. This is what cpu_shares does. If you give every zone theappropriate number of shares they will all get the proportional amountof system CPU when the system is fully loaded.2. Paying for what you get. On a system that is *not* fully loaded, intheory a zone could use lots and lots of CPUs. Customers would beincentivized to create and destroy zones until they found one thatcould use lots of free CPU. This is where CPU caps come in. They ensurethat on a system that is *not* fully loaded the zone can only burst upto a reasonable amount relative to what the customer is paying. Thisalso helps manage expectations. Setting a CPU cap reasonably close tothe amount of CPU that the customer gets when the system *is* fullyloaded means that people are less likely to *think* that they aresuffering from noisy neighbors (when the delta of how much CPU you geton a fully loaded vs fully unloaded system is small, you see moreconsistent performance.)

I haven't looked at the details of the SDC packages, but I canconfidently say that "vcpu" in the context of a joyent branded zone isan approximation of what to expect based on the shares and the cap (asopposed to a KVM VM where that's literally the number that the VM will see).

So if you have a SDC service zone with "1 vCPU and a cap of 200" thenit's getting shares such that when the system is fully loaded it shouldget approximately 1 CPU's worth of CPU time from the scheduler, but whenthe system is not fully loaded it should be able to get up to 2 CPUsworth of CPU time from the scheduler but no more. The difference betweenthose two is what the Joyent cloud advertises as "bursting"

Coming back for one last moment to KVM VMs, remember that the QEMUprocess is running in a zone that can have shares and caps.Additionally, when the VM does I/O, QEMU threads need to be scheduled todo some (overhead) work to make that happen.So in theory you might need your shares and caps to be slightly morethan just what the number of vCPUs might otherwise suggest (e.g. forsomething performance critical that *has* to live in a VM you couldimagine having vCPUs be 8, but wanting cpu_cap to be 900 and havingshares that give you some extra CPU time when the system is fully loaded.)


Finally let's see if I can answer your questions:

All of them, even database ones, are using 1 vCPU with a cap of 400.

They are configured so that when the system is fully loaded they shouldstill get about 1 CPU's worth of CPU time, but if the system isn't fullyloaded they can "burst" up to using 4 CPUs worth but no more.

I can picture in my head the "meaning" of the CPU cap when the cap islower than the sum of the CPU percentages. So something like 4 vCPUand a cap of 200 make sense to me.

I find that confusing both for KVM VMs or for regular zones. The capwould ensure that you never get more than 2 CPUs worth of compute timeand a KVM VM that thinks it has 4 processors but can never get more than2 processors worth of work done seems like a bad idea.

Can somebody explain me what happens with a setting of let say 1 vCPUand a cap of 200 ?

For a KVM VM the guest would see 1 processor, but would still haveheadroom for the I/O overhead from QEMU. For a regular zone it's likebefore, you get 1 CPU when the system is fully loaded, but can burst upto 2 when there are spare cycles (but no more than that).

In the SDC case what was the idea behind having this kind of settings? Does it give any performance/portability/else improvement?

The punchline comes down to "bursting". If you think your workloads arebursty then you want to leave some extra space in the caps so that thezones can take advantage of otherwise wasted cycles when they need them,but you also want to ensure fairness under load.


Hopefully this was helpful.

-Nahum


-------------------------------------------
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125&id_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com

Re: [smartos-discuss] Question regarding package sizing

Reply via email to