[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-18 Thread Milan Zamazal
Gianluca Cecchi  writes:

> On Thu, Dec 17, 2020 at 5:30 PM Milan Zamazal  wrote:
>
>> Gianluca Cecchi  writes:
>>
>> > On Wed, Dec 16, 2020 at 8:59 PM Milan Zamazal 
>> wrote:
>> >
>> >>
>> >> If the checkbox is unchecked, the migration shouldn't be prevented.
>> >> I think the TSC frequency shouldn't be written to the VM domain XML in
>> >> such a case and then there should be no restrictions (and no guarantees)
>> >> on the frequency.
>> >>
>> >> Do you mean you can't migrate even with the checkbox unchecked?  If so,
>> >> what error message do you get in such a case?
>> >>
>> >
>> > Yes, exactly.
>> > I powered off the VM and then disabled the check and then powered on the
>> VM
>> > again, that is running on host ov301. ANd I have other two hosts: ov300
>> and
>> > ov200.
>> > From web admin gui if I select the VM and "migrate" button I cannot
>> select
>> > the destination host and inside the bix there is the words "No available
>> > host to migrate VMs to" and going to engine.log, as soon as I click the
>> > "migrate" button I see these new lines:
>>
>> I see, I can reproduce it.  It looks like a bug in Engine.  While the VM
>> is correctly started without TSC frequency set, the migration filter in
>> Engine apparently still applies.
>>
>> I'll add a note about it to the TSC migration bug.
>>
>> Regards,
>> Milan
>>
>>
> Ok, thanks.
> In the meantime do I have any sort of workaround to be able to migrate the
> VM? Eg I could set the VM as non High Performance, or any better other
> option?

Non high performance VMs should migrate fine, but changing the VM kind
requires restart.  Once a high performance VM is running, I don't know
about any good way to avoid the TSC constraint.

Regards,
Milan
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/7UBMWPGKTPBOXB26YZMFZLPOXUZ3GEYE/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-17 Thread Gianluca Cecchi
On Thu, Dec 17, 2020 at 5:30 PM Milan Zamazal  wrote:

> Gianluca Cecchi  writes:
>
> > On Wed, Dec 16, 2020 at 8:59 PM Milan Zamazal 
> wrote:
> >
> >>
> >> If the checkbox is unchecked, the migration shouldn't be prevented.
> >> I think the TSC frequency shouldn't be written to the VM domain XML in
> >> such a case and then there should be no restrictions (and no guarantees)
> >> on the frequency.
> >>
> >> Do you mean you can't migrate even with the checkbox unchecked?  If so,
> >> what error message do you get in such a case?
> >>
> >
> > Yes, exactly.
> > I powered off the VM and then disabled the check and then powered on the
> VM
> > again, that is running on host ov301. ANd I have other two hosts: ov300
> and
> > ov200.
> > From web admin gui if I select the VM and "migrate" button I cannot
> select
> > the destination host and inside the bix there is the words "No available
> > host to migrate VMs to" and going to engine.log, as soon as I click the
> > "migrate" button I see these new lines:
>
> I see, I can reproduce it.  It looks like a bug in Engine.  While the VM
> is correctly started without TSC frequency set, the migration filter in
> Engine apparently still applies.
>
> I'll add a note about it to the TSC migration bug.
>
> Regards,
> Milan
>
>
Ok, thanks.
In the meantime do I have any sort of workaround to be able to migrate the
VM? Eg I could set the VM as non High Performance, or any better other
option?

Gianluca
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/SEBFTUV6W5EPEJVEZPZ6QT26DXCHD67W/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-17 Thread Milan Zamazal
Gianluca Cecchi  writes:

> On Wed, Dec 16, 2020 at 8:59 PM Milan Zamazal  wrote:
>
>>
>> If the checkbox is unchecked, the migration shouldn't be prevented.
>> I think the TSC frequency shouldn't be written to the VM domain XML in
>> such a case and then there should be no restrictions (and no guarantees)
>> on the frequency.
>>
>> Do you mean you can't migrate even with the checkbox unchecked?  If so,
>> what error message do you get in such a case?
>>
>
> Yes, exactly.
> I powered off the VM and then disabled the check and then powered on the VM
> again, that is running on host ov301. ANd I have other two hosts: ov300 and
> ov200.
> From web admin gui if I select the VM and "migrate" button I cannot select
> the destination host and inside the bix there is the words "No available
> host to migrate VMs to" and going to engine.log, as soon as I click the
> "migrate" button I see these new lines:

I see, I can reproduce it.  It looks like a bug in Engine.  While the VM
is correctly started without TSC frequency set, the migration filter in
Engine apparently still applies.

I'll add a note about it to the TSC migration bug.

Regards,
Milan

> 2020-12-16 23:13:27,949+01 INFO
>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-41)
> [308a29e2-2c4f-45fe-bdce-b032b36d4656] Candidate host 'ov300'
> ('07b979fb-4779-4477-89f2-6a96093c06f7') was filtered out by
> 'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
> id: null)
> 2020-12-16 23:13:27,949+01 INFO
>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-41)
> [308a29e2-2c4f-45fe-bdce-b032b36d4656] Candidate host 'ov200'
> ('949d0087-2c24-4759-8427-f9eade1dd2cc') was filtered out by
> 'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
> id: null)
> 2020-12-16 23:13:28,032+01 INFO
>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-38)
> [5837b695-c70d-4f45-a452-2c7c1b4ea69b] Candidate host 'ov300'
> ('07b979fb-4779-4477-89f2-6a96093c06f7') was filtered out by
> 'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
> id: null)
> 2020-12-16 23:13:28,032+01 INFO
>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-38)
> [5837b695-c70d-4f45-a452-2c7c1b4ea69b] Candidate host 'ov200'
> ('949d0087-2c24-4759-8427-f9eade1dd2cc') was filtered out by
> 'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
> id: null)
>
> On all three nodes I have this kind of running kernel and package versions:
>
> [root@ov300 vdsm]# rpm -q qemu-kvm libvirt-daemon systemd
> qemu-kvm-4.2.0-34.module_el8.3.0+555+a55c8938.x86_64
> libvirt-daemon-6.0.0-28.module_el8.3.0+555+a55c8938.x86_64
> systemd-239-41.el8_3.x86_64
>
> and
> [root@ov300 vdsm]# uname -r
> 4.18.0-240.1.1.el8_3.x86_64
> [root@ov300 vdsm]#
>
> Gianluca
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/WX4PA75LAOXIN6PKYDTZ5UZ4OMZICXEY/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-16 Thread Gianluca Cecchi
On Wed, Dec 16, 2020 at 8:59 PM Milan Zamazal  wrote:

>
> If the checkbox is unchecked, the migration shouldn't be prevented.
> I think the TSC frequency shouldn't be written to the VM domain XML in
> such a case and then there should be no restrictions (and no guarantees)
> on the frequency.
>
> Do you mean you can't migrate even with the checkbox unchecked?  If so,
> what error message do you get in such a case?
>

Yes, exactly.
I powered off the VM and then disabled the check and then powered on the VM
again, that is running on host ov301. ANd I have other two hosts: ov300 and
ov200.
>From web admin gui if I select the VM and "migrate" button I cannot select
the destination host and inside the bix there is the words "No available
host to migrate VMs to" and going to engine.log, as soon as I click the
"migrate" button I see these new lines:

2020-12-16 23:13:27,949+01 INFO
 [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-41)
[308a29e2-2c4f-45fe-bdce-b032b36d4656] Candidate host 'ov300'
('07b979fb-4779-4477-89f2-6a96093c06f7') was filtered out by
'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
id: null)
2020-12-16 23:13:27,949+01 INFO
 [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-41)
[308a29e2-2c4f-45fe-bdce-b032b36d4656] Candidate host 'ov200'
('949d0087-2c24-4759-8427-f9eade1dd2cc') was filtered out by
'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
id: null)
2020-12-16 23:13:28,032+01 INFO
 [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-38)
[5837b695-c70d-4f45-a452-2c7c1b4ea69b] Candidate host 'ov300'
('07b979fb-4779-4477-89f2-6a96093c06f7') was filtered out by
'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
id: null)
2020-12-16 23:13:28,032+01 INFO
 [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-38)
[5837b695-c70d-4f45-a452-2c7c1b4ea69b] Candidate host 'ov200'
('949d0087-2c24-4759-8427-f9eade1dd2cc') was filtered out by
'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
id: null)

On all three nodes I have this kind of running kernel and package versions:

[root@ov300 vdsm]# rpm -q qemu-kvm libvirt-daemon systemd
qemu-kvm-4.2.0-34.module_el8.3.0+555+a55c8938.x86_64
libvirt-daemon-6.0.0-28.module_el8.3.0+555+a55c8938.x86_64
systemd-239-41.el8_3.x86_64

and
[root@ov300 vdsm]# uname -r
4.18.0-240.1.1.el8_3.x86_64
[root@ov300 vdsm]#

Gianluca
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/X3UOO7RAE2GIHUYTD6YRUYP75UN53ALG/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-16 Thread Milan Zamazal
Gianluca Cecchi  writes:

> On Fri, Dec 11, 2020 at 5:39 PM Milan Zamazal  wrote:
>
>>
>>
>> TSC frequency is the frequency with which Time Stamp Counter register is
>> updated, typically a nominal CPU frequency (see
>> https://en.wikipedia.org/wiki/Time_Stamp_Counter for more details).
>>
>> You can check the value oVirt gets from libvirt by running
>>
>>   # virsh -r capabilities
>>
>> and looking at the line like
>>
>>   
>>
>> in the output.  Unless frequency scaling is available, the host
>> frequencies must be almost the same in order to be able to migrate high
>> performance VMs among them.
>>
>> Note there is a bug that may cause a migration failure for the VMs even
>> between hosts with the same frequencies
>> (https://bugzilla.redhat.com/1821199).  But this is apparently not your
>> case, since the migration is prevented already by Engine.
>>
>> Regards,
>> Milan
>>
>>
>>
> See here:
>
> [root@ov200 ~]# virsh -r capabilities | grep "name='tsc'"
>   
> [root@ov200 ~]#
>
> [root@ov300 ~]# virsh -r capabilities | grep "name='tsc'"
>   
> [root@ov300 ~]#
>
> [root@ov301 ~]# virsh -r capabilities | grep "name='tsc'"
>   
> [root@ov301 ~]#
>
> The three hosts have the same model cpu
> Model name:  Intel(R) Xeon(R) CPU   X5690  @ 3.47GHz
> and slightly different actual frequencies at a certain moment...

OK, so this is actually https://bugzilla.redhat.com/1821199.

> But what does it mean so the checkbox
>
> Migrate only to hosts with the same TSC frequency
>
> if even if I don't check it the migration is prevented?

If the checkbox is unchecked, the migration shouldn't be prevented.
I think the TSC frequency shouldn't be written to the VM domain XML in
such a case and then there should be no restrictions (and no guarantees)
on the frequency.

Do you mean you can't migrate even with the checkbox unchecked?  If so,
what error message do you get in such a case?

> BTW the command lscpu produces exactly the same output on the three hosts,
> apart "CPU MHz" and corresponding "BogoMIPS" that slightly change each time
> I run the command.

Yes, the TSC frequency is measured on each boot and may differ across
reboots on the same host.

> And the flags for all are:
>
> Flags:   fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge
> mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx
> pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl
> xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl
> vmx smx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 popcnt aes
> lahf_lm pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid
> dtherm ida arat flush_l1d
>
> Gianluca

Regards,
Milan
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/ALBUWFG7ZHRT7SOOCTJERX7ASRRWGOL6/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-14 Thread thomas
I'd put my money on a fall-through error condition where TSC is simply the last 
one with a 'good' error message pointed to. I have clusters with CPUs that are 
both 10 years and 10x apart in performance performing migrations between 
themselves quite happily (Sandy Bridge dual quads to Skylake 56 cores), as long 
as you make sure the cluster and default machine type low enough.

Ok this is 4.3 still, but...

So what if you start the VM on the 'weak' host first? Can it then move freely?
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZCV2FQGEWIXKMRLTF6MAKDN2SMST7ZPX/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-11 Thread Gianluca Cecchi
On Fri, Dec 11, 2020 at 5:39 PM Milan Zamazal  wrote:

>
>
> TSC frequency is the frequency with which Time Stamp Counter register is
> updated, typically a nominal CPU frequency (see
> https://en.wikipedia.org/wiki/Time_Stamp_Counter for more details).
>
> You can check the value oVirt gets from libvirt by running
>
>   # virsh -r capabilities
>
> and looking at the line like
>
>   
>
> in the output.  Unless frequency scaling is available, the host
> frequencies must be almost the same in order to be able to migrate high
> performance VMs among them.
>
> Note there is a bug that may cause a migration failure for the VMs even
> between hosts with the same frequencies
> (https://bugzilla.redhat.com/1821199).  But this is apparently not your
> case, since the migration is prevented already by Engine.
>
> Regards,
> Milan
>
>
>
See here:

[root@ov200 ~]# virsh -r capabilities | grep "name='tsc'"
  
[root@ov200 ~]#

[root@ov300 ~]# virsh -r capabilities | grep "name='tsc'"
  
[root@ov300 ~]#

[root@ov301 ~]# virsh -r capabilities | grep "name='tsc'"
  
[root@ov301 ~]#

The three hosts have the same model cpu
Model name:  Intel(R) Xeon(R) CPU   X5690  @ 3.47GHz
and slightly different actual frequencies at a certain moment...

But what does it mean so the checkbox

Migrate only to hosts with the same TSC frequency

if even if I don't check it the migration is prevented?

BTW the command lscpu produces exactly the same output on the three hosts,
apart "CPU MHz" and corresponding "BogoMIPS" that slightly change each time
I run the command.

And the flags for all are:

Flags:   fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx
pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl
xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl
vmx smx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 popcnt aes
lahf_lm pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid
dtherm ida arat flush_l1d

Gianluca
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/2ZCLAOXITG4UT55B5YMOH2G5WVUM4ZBB/


[ovirt-users] Re: High performance VM cannot migrate due to TSC frequency

2020-12-11 Thread Milan Zamazal
Gianluca Cecchi  writes:

> Hello,
> I'm in 4.4.3 and CentOS 8.3 with 3 hosts.
>
> I have a high performance VM that is running on ov300 and is configured to
> be run on any host.
>
> It seems that both if I set or not the option
>
> Migrate only to hosts with the same TSC frequency
>
> I always am unable to migrate the VM and inside engine.log I see this:
>
> 2020-12-11 15:56:03,424+01 INFO
>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-36)
> [e4801b28-c832-4474-aa53-4ebfd7c6e2d0] Candidate host 'ov301'
> ('382bfc8f-60d5-4e06-8571-7dae1700574d') was filtered out by
> 'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
> id: null)
>
> 2020-12-11 15:56:03,424+01 INFO
>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-36)
> [e4801b28-c832-4474-aa53-4ebfd7c6e2d0] Candidate host 'ov200'
> ('949d0087-2c24-4759-8427-f9eade1dd2cc') was filtered out by
> 'VAR__FILTERTYPE__INTERNAL' filter 'Migration-Tsc-Frequency' (correlation
> id: null)
>
> Can you verify if it is only my problem?
>
> Apart from the problem itself, what is "TSC frequency" and how can I check
> if my 3 hosts are different or not indeed?

TSC frequency is the frequency with which Time Stamp Counter register is
updated, typically a nominal CPU frequency (see
https://en.wikipedia.org/wiki/Time_Stamp_Counter for more details).

You can check the value oVirt gets from libvirt by running

  # virsh -r capabilities

and looking at the line like

  

in the output.  Unless frequency scaling is available, the host
frequencies must be almost the same in order to be able to migrate high
performance VMs among them.

Note there is a bug that may cause a migration failure for the VMs even
between hosts with the same frequencies
(https://bugzilla.redhat.com/1821199).  But this is apparently not your
case, since the migration is prevented already by Engine.

Regards,
Milan

> Normal VMs are able to migrate without problems
>
> Thanks,
> Gianluca
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/2HYSCVHSVZS6KX5FF5MOXI6YTLQOJIK7/
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/SXQRJZAYTBFBHZJQVMRUL625NE3NPCZ6/