Re: [PATCH] irqchip/gic-v4.1: Optimize the delay time of the poll on the GICR_VPENDBASER.Dirty bit

Marc Zyngier Wed, 16 Sep 2020 01:40:45 -0700

On 2020-09-16 08:04, lushenming wrote:

Hi,
Our team just discussed this issue again and consulted our GIC hardware
design team. They think the RD can afford busy waiting. So we stillthink
maybe 0 is better, at least for our hardware.
In addition, if not 0, as I said before, in our measurement, it takesonly
hundreds of nanoseconds, or 1~2 microseconds, to finish parsing the VPT
in most cases. So maybe 1 microseconds, or smaller, is moreappropriate.
Anyway, 10 microseconds is too much.
But it has to be said that it does depend on the hardwareimplementation.


Exactly. And given that the only publicly available implementation is
a software model, I am reluctant to change "performance" related things
based on benchmarks that can't be verified and appears to me as a micro
optimization.

Besides, I'm not sure where are the start and end point of the totalschedulinglatency of a vcpu you said, which includes many events. Is the parsetime of
the VPT not clear enough?

Measure the time it takes from kvm_vcpu_load() to the point where thevcpu

enters the guest. How much, in proportion, do these 1/2/10ms represent?

Also, a better(?) course of action would maybe to consider whether weshould

split the its_vpe_schedule() call into two distinct operations: one that

programs the VPE to be resident, and another that poll the Dirty bit*muchlater* on the entry path, giving the GIC a chance to work in parallelwith

the CPU on the entry path.

If your HW is a quick as you say it is, it would pretty much guarantee
a clear read of GICR_VPENDBASER without waiting.

        M.
--
Jazz is not dead. It just smells funny...

Re: [PATCH] irqchip/gic-v4.1: Optimize the delay time of the poll on the GICR_VPENDBASER.Dirty bit

Reply via email to