On Fri, Jun 29, 2012 at 07:26:38PM -0300, Marcelo Tosatti wrote:
> On Tue, Jun 12, 2012 at 03:01:27PM +0300, Gleb Natapov wrote:
> > "rep ins" emulation is going through emulator now. This is slow because
> > emulator knows how to write back only one datum at a time. This patch
> > provides fast path for the instruction in certain conditions. The
> > conditions are: DF flag is not set, destination memory is RAM and single
> > datum does not cross page boundary. If fast path code fails it falls
> > back to emulation.
> > 
> > Signed-off-by: Gleb Natapov <[email protected]>
> > ---
> >  arch/x86/include/asm/kvm_host.h |    6 ++
> >  arch/x86/kvm/svm.c              |   20 +++++--
> >  arch/x86/kvm/vmx.c              |   25 +++++--
> >  arch/x86/kvm/x86.c              |  133 
> > ++++++++++++++++++++++++++++++++++++--
> >  4 files changed, 165 insertions(+), 19 deletions(-)
> > diff --git a/arch/x86/kvm/svm.c b/arch/x86/kvm/svm.c
> > index 7a41878..f3e7bb3 100644
> > --- a/arch/x86/kvm/svm.c
> > +++ b/arch/x86/kvm/svm.c
> > @@ -1887,21 +1887,31 @@ static int io_interception(struct vcpu_svm *svm)
> >  {
> >     struct kvm_vcpu *vcpu = &svm->vcpu;
> >     u32 io_info = svm->vmcb->control.exit_info_1; /* address size bug? */
> > -   int size, in, string;
> > +   int size, in, string, rep;
> >     unsigned port;
> >  
> >     ++svm->vcpu.stat.io_exits;
> >     string = (io_info & SVM_IOIO_STR_MASK) != 0;
> > +   rep = (io_info & SVM_IOIO_REP_MASK) != 0;
> >     in = (io_info & SVM_IOIO_TYPE_MASK) != 0;
> > -   if (string || in)
> > -           return emulate_instruction(vcpu, 0) == EMULATE_DONE;
> >  
> >     port = io_info >> 16;
> >     size = (io_info & SVM_IOIO_SIZE_MASK) >> SVM_IOIO_SIZE_SHIFT;
> >     svm->next_rip = svm->vmcb->control.exit_info_2;
> > -   skip_emulated_instruction(&svm->vcpu);
> >  
> > -   return kvm_fast_pio_out(vcpu, size, port);
> > +   if (!string && !in) {
> > +           skip_emulated_instruction(&svm->vcpu);
> > +           return kvm_fast_pio_out(vcpu, size, port);
> > +   } else if (string && in && rep) {
> 
> Is there a reason to restrict optimization to rep ? That is, 
> it should be easy to extend to normal in?
> 
Normal "in" does not have performance problem to the best of my knowledge.
Going through emulator for non performance critical code means less logic
to duplicate.

> > +           kvm_x86_ops->skip_emulated_instruction(vcpu);
> > +           return EMULATE_DONE;
> > +   }
> > +   if (kvm_get_rflags(vcpu) & X86_EFLAGS_DF)
> > +           return EMULATE_FAIL;
> > +   if (ad_bytes_idx > 2)
> > +           return EMULATE_FAIL;
> > +
> > +   ad_bytes = (u8[]){2, 4, 8}[ad_bytes_idx];
> > +
> > +   rdi = kvm_address_mask(ad_bytes, rdi);
> > +
> > +   count = (PAGE_SIZE - offset_in_page(rdi))/size;
> > +
> > +   if (count == 0) /* 'in' crosses page boundry */
> > +           return EMULATE_FAIL;
> > +
> > +   count = min(count, kvm_address_mask(ad_bytes, rcx));
> > + 
> > +   r = kvm_linearize_address(vcpu, get_emulation_mode(vcpu),
> > +                   rdi, VCPU_SREG_ES, count, true, false, ad_bytes,
> > +                   &linear_addr);
> 
> kvm_linearize_address expects size parameter in bytes?
Yes.

--
                        Gleb.
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to