Re: [patch 1/3] KVM: MMU: collapse remote TLB flushes on root sync

Avi Kivity Thu, 30 Oct 2008 03:04:43 -0700

Marcelo Tosatti wrote:

There is significant overhead now in comparison to the early indexing
scheme with a list per root. It must be optimized.

What's the typical number of (1) unsynced pages and (2) unsynced pagesbelonging to next cr3 when switching cr3?


I'm guessing (1) and (2) are almost equal, and both fairly large?

(one nice thing is that pages which are no longer used as pagetableswill not be resynced, so our forky workload preformance should be good)

A problem with your suggestion is how to clean the unsync bitmap bit in
the upper pagetables.


True.

The advantage however is that the bitmaps and spte
entries can be cached in L1, while currently the cache is blown at
every page resync.

What i'm testing now is:

#define KVM_PAGE_ARRAY_NR 16

struct kvm_mmu_pages {
        struct kvm_mmu_page *pages[KVM_PAGE_ARRAY_NR];
        struct kvm_mmu_page *parent_pages[KVM_PAGE_ARRAY_NR];
        unsigned int offset[KVM_PAGE_ARRAY_NR];

   struct { ... } ...[KVM_PAGE_ARRAY_NR]; // SCNR

        unsigned int nr;
};

static void mmu_sync_children(struct kvm_vcpu *vcpu, struct kvm_mmu_page *sp)
{
        int i;
        struct kvm_mmu_pages pages;

        kvm_mmu_pages_init(&pages);
        while (mmu_unsync_walk(sp, &pages)) {

So mmu_unsync_walk() collects unsynced pages, write protects them,flushes the tlb, returns 0 if none found?

Perhaps it can simply collect the pages, then do a write-protect pass,tlb flush, and resync pass. The two passes are now over L1 cached dataso they're not too expensive.

                for_each_sp(pages, i) {
                        struct kvm_mmu_page *parent = pages.parent_pages[i];

                        kvm_sync_page(vcpu, pages.pages[i]);
                        __clear_bit(pages.offset[i],
                                    parent->unsync_child_bitmap);

                }
                kvm_mmu_pages_init(&pages);
                cond_resched_lock(&vcpu->kvm->mmu_lock);
        }
}

But a second pass is still needed for levels 3 and 4 (the second pass
could be postponed for the next CR3 switch, but i'm not sure its
worthwhile).


We could:
 - keep all three parents in the array
 - for the bitmap, keep a count of how many bits are set

- when we clear a bit, we dec the count, and if zero, we clear the bitin the parent's parent.

Also, the array method poorly handles cases with a large number ofunsync pages, which are common with 2.4/kscand for example.

Well, depends how large the array is. If it's large enough, the tlbflush cost is overwhelmed by the cost of the actual resync.

Hum, Chris suggests a list_head per level instead of the bitmap.

Doesn't work, a page can have multiple parents, so it would need to belinked to multiple lists.


We can have

union {
   bitmap;
   array;
}

and use the array for sparse bitmaps (or use the bitmap for overflowedarrays)


--
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [patch 1/3] KVM: MMU: collapse remote TLB flushes on root sync

Reply via email to