On Thursday 15 January 2009 01:03:59 Marcelo Tosatti wrote:
> On Wed, Jan 14, 2009 at 05:17:22PM +0800, Sheng Yang wrote:
> > > So calculating the offset using last interrupt injection is not very
> > > reasonable in this case.
> >
> > Um... I think it's not easy to find a complete reasonable result here, I
> > haven't look into how OS us TMCCT, maybe it can help us to find a more
> > reasonable result here.
>
> Linux, FreeBSD use it for calibration only. And Windows too, apparently
> (at least a couple of versions tested).
>
> What about this?
>
> Index: kvm/arch/x86/kvm/lapic.c
> ===================================================================
> --- kvm.orig/arch/x86/kvm/lapic.c
> +++ kvm/arch/x86/kvm/lapic.c
> @@ -511,52 +511,31 @@ static void apic_send_ipi(struct kvm_lap
>
> static u32 apic_get_tmcct(struct kvm_lapic *apic)
> {
> - u64 counter_passed;
> - ktime_t passed, now;
> - u32 tmcct;
> + ktime_t remaining;
> + u32 tmcct = 0;
>
> ASSERT(apic != NULL);
>
> - now = apic->timer.dev.base->get_time();
> - tmcct = apic_get_reg(apic, APIC_TMICT);
> -
> /* if initial count is 0, current count should also be 0 */
> - if (tmcct == 0)
> + if (apic_get_reg(apic, APIC_TMICT) == 0)
> return 0;
>
> - if (unlikely(ktime_to_ns(now) <=
> - ktime_to_ns(apic->timer.last_update))) {
> - /* Wrap around */
> - passed = ktime_add(( {
> - (ktime_t) {
> - .tv64 = KTIME_MAX -
> - (apic->timer.last_update).tv64}; }
> - ), now);
> - apic_debug("time elapsed\n");
> - } else
> - passed = ktime_sub(now, apic->timer.last_update);
> -
> - counter_passed = div64_u64(ktime_to_ns(passed),
> - (APIC_BUS_CYCLE_NS *
> apic->timer.divide_count));
> -
> - if (counter_passed > tmcct) {
> - if (unlikely(!apic_lvtt_period(apic))) {
> - /* one-shot timers stick at 0 until reset */
> - tmcct = 0;
> - } else {
> - /*
> - * periodic timers reset to APIC_TMICT when they
> - * hit 0. The while loop simulates this happening N
> - * times. (counter_passed %= tmcct) would also work,
> - * but might be slower or not work on 32-bit??
> - */
> - while (counter_passed > tmcct)
> - counter_passed -= tmcct;
> - tmcct -= counter_passed;
> - }
> - } else {
> - tmcct -= counter_passed;
> - }
> + /*
> + * Since reinjection is not rate-limited, use the delay
> + * to inject the last interrupt as an estimate.
> + */
> + if (unlikely(atomic_read(&apic->timer.pending) > 0)) {
> + remaining = apic->timer.injection_delay;
> + if (ktime_to_ns(remaining) > apic->timer.period)
> + remaining = ns_to_ktime(apic->timer.period);
> + } else
> + remaining = hrtimer_expires_remaining(&apic->timer.dev);
A little doubt...
A: time_fire
B: intr_post
C: read TMCCT
The sequence can be ABC or ACB.
injection_delay = time(B) - time(A)
So it didn't count time from read TMCCT... And guest get interrupt at time(B),
not quite understand why time(B) - time(A) matters here...
I think the reasonable here means, this interval is usable later after the
accumulated interrupts are injected. From this point of view, I think current
solution is reasonable. It just assume the delayed interrupts have been
injected.
However, seriously, any value here is wrong, no elegant one. But I still
prefer to the current solution...
And here is not the really problem for now I think. The current mechanism is
mostly OK, but where is the bug... We have either have a simple fix(e.g. if
now < last_update, then return 0) or dig into it. Did it worth a try? Anyway,
it would return a buggy result if we have pending interrupts...
--
regards
Yang, Sheng
> +
> + if (remaining.tv64 > 0)
> + tmcct = div64_u64(ktime_to_ns(remaining),
> + (APIC_BUS_CYCLE_NS * apic->timer.divide_count));
> +
> + WARN_ON(tmcct > apic_get_reg(apic, APIC_TMICT));
>
> return tmcct;
> }
> @@ -653,8 +632,6 @@ static void start_apic_timer(struct kvm_
> {
> ktime_t now = apic->timer.dev.base->get_time();
>
> - apic->timer.last_update = now;
> -
> apic->timer.period = apic_get_reg(apic, APIC_TMICT) *
> APIC_BUS_CYCLE_NS * apic->timer.divide_count;
> atomic_set(&apic->timer.pending, 0);
> @@ -972,6 +949,7 @@ static int __apic_timer_fn(struct kvm_la
> set_bit(KVM_REQ_PENDING_TIMER, &apic->vcpu->requests);
> if (waitqueue_active(q))
> wake_up_interruptible(q);
> + apic->timer.int_trigger = ktime_get();
>
> if (apic_lvtt_period(apic)) {
> result = 1;
> @@ -1112,12 +1090,14 @@ void kvm_inject_apic_timer_irqs(struct k
>
> void kvm_apic_timer_intr_post(struct kvm_vcpu *vcpu, int vec)
> {
> + ktime_t delta;
> struct kvm_lapic *apic = vcpu->arch.apic;
>
> - if (apic && apic_lvt_vector(apic, APIC_LVTT) == vec)
> - apic->timer.last_update = ktime_add_ns(
> - apic->timer.last_update,
> - apic->timer.period);
> + if (apic && apic_lvt_vector(apic, APIC_LVTT) == vec &&
> + atomic_read(&apic->timer.pending) == 0) {
> + delta = ktime_sub(ktime_get(), apic->timer.int_trigger);
> + apic->timer.injection_delay = delta;
> + }
> }
>
> int kvm_get_apic_interrupt(struct kvm_vcpu *vcpu)
> Index: kvm/arch/x86/kvm/lapic.h
> ===================================================================
> --- kvm.orig/arch/x86/kvm/lapic.h
> +++ kvm/arch/x86/kvm/lapic.h
> @@ -12,7 +12,8 @@ struct kvm_lapic {
> atomic_t pending;
> s64 period; /* unit: ns */
> u32 divide_count;
> - ktime_t last_update;
> + ktime_t injection_delay;
> + ktime_t int_trigger;
> struct hrtimer dev;
> } timer;
> struct kvm_vcpu *vcpu;
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html