On Mon, Nov 14, 2022 at 11:12 AM <martin.dec...@ora-solutions.net> wrote:

> Hello List,
>
> how can I achieve the best performance with vcpu pinning in KVM?
>
> Is it better to have 1:1 mapping between virtual and physical thread like
> this:
>
>     <vcpupin vcpu='0' cpuset='0'/>
>     <vcpupin vcpu='1' cpuset='1'/>
>     <vcpupin vcpu='2' cpuset='2'/>
>     <vcpupin vcpu='3' cpuset='3'/>
>     <vcpupin vcpu='4' cpuset='4'/>
>     <vcpupin vcpu='5' cpuset='5'/>
>     <vcpupin vcpu='6' cpuset='6'/>
>     <vcpupin vcpu='7' cpuset='7'/>
>
> Or is it better to allow each vCPU to run on any of the limited number of
> physical threads?
>
> e.g.
>     <vcpupin vcpu='0' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='1' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='2' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='3' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='4' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='5' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='6' cpuset='0,1,2,3,4,5,6,7'/>
>     <vcpupin vcpu='7' cpuset='0,1,2,3,4,5,6,7'/>
>
>
> The hypervisor host has 2 CPUs. Each CPU has 8 cores and each core 2
> Threads. In total this are 32 threads.
>
>  lscpu
> Architecture:          x86_64
> CPU op-mode(s):        32-bit, 64-bit
> Byte Order:            Little Endian
> CPU(s):                32
> On-line CPU(s) list:   0-31
> Thread(s) per core:    2
> Core(s) per socket:    8
> Socket(s):             2
> NUMA node(s):          2
> Vendor ID:             GenuineIntel
> CPU family:            6
> Model:                 85
> Model name:            Intel(R) Xeon(R) Gold 6244 CPU @ 3.60GHz
> Stepping:              7
> CPU MHz:               4213.731
> CPU max MHz:           4400.0000
> CPU min MHz:           1200.0000
> BogoMIPS:              7200.00
> Virtualization:        VT-x
> L1d cache:             32K
> L1i cache:             32K
> L2 cache:              1024K
> L3 cache:              25344K
> NUMA node0 CPU(s):     0-7,16-23
> NUMA node1 CPU(s):     8-15,24-31
>
> I need to run 2 VMs on this host. One is for production and should have 24
> vCPUs and the other one is for test and should have 8 vCPUs. Test VM
> workload should not impact Prod VM performance.
>
> Current configuration is:
>
> TEST:
>
>   <vcpu placement='static' current='8'>128</vcpu>
>   <iothreads>1</iothreads>
>   <cputune>
>     <vcpupin vcpu='0' cpuset='0'/>
>     <vcpupin vcpu='1' cpuset='1'/>
>     <vcpupin vcpu='2' cpuset='2'/>
>     <vcpupin vcpu='3' cpuset='3'/>
>     <vcpupin vcpu='4' cpuset='4'/>
>     <vcpupin vcpu='5' cpuset='5'/>
>     <vcpupin vcpu='6' cpuset='6'/>
>     <vcpupin vcpu='7' cpuset='7'/>
>   </cputune>
>
>   <cpu mode='host-passthrough' check='none'>
>     <topology sockets='16' cores='4' threads='2'/>
>     <numa>
>       <cell id='0' cpus='0-7' memory='134217728' unit='KiB'/>
>     </numa>
>   </cpu>
>
> PROD:
>
>   <vcpu placement='static' current='24'>192</vcpu>
>   <iothreads>1</iothreads>
>   <cputune>
>     <vcpupin vcpu='0' cpuset='8'/>
>     <vcpupin vcpu='1' cpuset='9'/>
>     <vcpupin vcpu='2' cpuset='10'/>
>     <vcpupin vcpu='3' cpuset='11'/>
>     <vcpupin vcpu='4' cpuset='12'/>
>     <vcpupin vcpu='5' cpuset='13'/>
>     <vcpupin vcpu='6' cpuset='14'/>
>     <vcpupin vcpu='7' cpuset='15'/>
>     <vcpupin vcpu='8' cpuset='16'/>
>     <vcpupin vcpu='9' cpuset='17'/>
>     <vcpupin vcpu='10' cpuset='18'/>
>     <vcpupin vcpu='11' cpuset='19'/>
>     <vcpupin vcpu='12' cpuset='20'/>
>     <vcpupin vcpu='13' cpuset='21'/>
>     <vcpupin vcpu='14' cpuset='22'/>
>     <vcpupin vcpu='15' cpuset='23'/>
>     <vcpupin vcpu='16' cpuset='24'/>
>     <vcpupin vcpu='17' cpuset='25'/>
>     <vcpupin vcpu='18' cpuset='26'/>
>     <vcpupin vcpu='19' cpuset='27'/>
>     <vcpupin vcpu='20' cpuset='28'/>
>     <vcpupin vcpu='21' cpuset='29'/>
>     <vcpupin vcpu='22' cpuset='30'/>
>     <vcpupin vcpu='23' cpuset='31'/>
>   </cputune>
>
>   <cpu mode='host-passthrough' check='none'>
>     <topology sockets='16' cores='6' threads='2'/>
>     <numa>
>       <cell id='0' cpus='0-23' memory='889192448' unit='KiB'/>
>     </numa>
>   </cpu>
>
> Versions are:
> ovirt-host-4.3.5
> libvirt-5.7.0-28.el7.x86_64
>
Hi Martin,
The recommended approach would be to pin the CPUs into specific physical
CPUs.
The most important part is to be under the same socket.
VDSM uses physical CPU 1, unless you changed the default VDSM settings, so
maybe it would be better to use the second socket.
You may also wish to refer to the NUMA topology (using socket 1 that would
be - NUMA node1 CPU(s):     8-15,24-31).
Overall:
0#8,24_1#9,25_2#10,26 and so on.
This is also with the thought that physical cpu 8 and 24 are two threads in
the same core. You can check it using VDSM API or just # cat /proc/cpuinfo.
Note, that physical CPUs you use with this method are in a shared pool and
can be used by other VMs.
If you wish it to be exclusive to the VM, you may use the dedicated CPU
feature under the VM resource allocation tab. Note, it will pin the CPUs
for you.

Regards,
Liran.


> Thanks in advance,
> Martin
> _______________________________________________
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/HCNWYSIBMUY7Q3YU25KQL2MUCK42GMJ6/
>
_______________________________________________
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/UDWR2WVOLY7SCEXQHZQJJK7RMT4HBE3B/

Reply via email to