Re: Thoughts on sharing KVM tracepoints [was:Re: [PATCH 2/2] KVM: nVMX: trace nested vm entry]

2021-01-27 Thread Paolo Bonzini

On 25/01/21 22:01, Sean Christopherson wrote:

I 100% think that VMX and SVM should share the bulk of the
code.  Improvements to VMX almost always apply in some way to SVM, and vice
versa.


I agree.


IMO, after debugging a few times, associating
error_code with the event being injected is second nature.  Prepending
intr_info_ would just add extra characters and slow down mental processing.


of course both it and intr_info are VMX specific).


Not really, SVM has the exact same fields with slightly different names.



I slightly prefer the SVM names, using eventinj and eventinjerr in the 
trace points wouldn't be bad.


Having too many tracepoints are a problem.  Having a lot of info in a 
single tracepoint is not a problem, though.


Paolo



Re: Thoughts on sharing KVM tracepoints [was:Re: [PATCH 2/2] KVM: nVMX: trace nested vm entry]

2021-01-25 Thread Sean Christopherson
On Mon, Jan 25, 2021, Maxim Levitsky wrote:
> On Thu, 2021-01-21 at 14:27 -0800, Sean Christopherson wrote:
> > I still don't see why VMX can't share this with SVM.  "npt' can easily be 
> > "tdp",
> > differentiating between VMCB and VMCS can be down with ISA, and VMX can 
> > give 0
> > for int_ctl (or throw in something else interesting/relevant).
> 
> I understand very well your point, and I don't strongly disagree with you.
> However let me voice my own thoughts on this:
>  
> I think that sharing tracepoints between SVM and VMX isn't necessarily a good 
> idea.
> It does make sense in some cases but not in all of them.
>  
> The trace points are primarily intended for developers, thus they should 
> capture as
> much as possible relevant info but not everything because traces can get huge.
>  
> Also despite the fact that a developer will look at the traces, some 
> usability is welcome
> as well (e.g for new developers), and looking at things like 
> info1/info2/intr_info/error_code
> isn't very usable

I'm not opposed to printing different names on VMX, e.g. exit_qual and
idt_vec_info, but I 100% think that VMX and SVM should share the bulk of the
code.  Improvements to VMX almost always apply in some way to SVM, and vice
versa.  It's all but guaranteed that splitting flows will eventually cause
divergence in a bad way.  Divergence in tracepoints is likely to be minor at
worst, but I don't think that's a good reason to intentionally split the code
when it's quite easy to share.

> (btw the error_code should at least be called intr_info_error_code, and

Heh, I disagree even on this.  IMO, after debugging a few times, associating
error_code with the event being injected is second nature.  Prepending
intr_info_ would just add extra characters and slow down mental processing.

> of course both it and intr_info are VMX specific).

Not really, SVM has the exact same fields with slightly different names.

> So I don't even like the fact that kvm_entry/kvm_exit are shared, and neither 
> I want
> to add even more shared trace points.
>
> I understand that there are some benefits of sharing, namely a userspace tool 
> can use
> the same event to *profile* kvm, but I am not sure that this is worth it.

Why is it not worth it?  It's a small amount of one-time kernel pain that allows
all users/developers to reuse scripts and tools across VMX and SVM.  Even manual
usage benefits, e.g. I don't have to remember that a tracepoint is 'x' on VMX
but 'y' on SVM.

> What we could have done is to have ISA (and maybe even x86) agnostic 
> kvm_exit/kvm_entry
> tracepoints that would have no data attached to them, or have very little 
> (like maybe RIP),
> and then have ISA specific tracepoints with the reset of the info.

That would probably end up as the least user friendly combination.  Usually I
enable a tracepoint to get more info, rarely am I interested in _just_ the
logging of the tracepoint itself.  The generic tracepoint would either be
useless and never enabled, or even worse would cause people to overlook the
vendor-specific variant.

> Same could be applied to kvm_nested_vmenter, although for this one I don't 
> think that we
> need an ISA agnostic tracepoint.
>  
> Having said all that, I am not hell bent on this. If you really want it to be 
> this way,
> I won't argue that much.
>  
> Thoughts?


Thoughts on sharing KVM tracepoints [was:Re: [PATCH 2/2] KVM: nVMX: trace nested vm entry]

2021-01-25 Thread Maxim Levitsky
On Thu, 2021-01-21 at 14:27 -0800, Sean Christopherson wrote:
> On Thu, Jan 21, 2021, Maxim Levitsky wrote:
> > This is very helpful to debug nested VMX issues.
> > 
> > Signed-off-by: Maxim Levitsky 
> > ---
> >  arch/x86/kvm/trace.h  | 30 ++
> >  arch/x86/kvm/vmx/nested.c |  5 +
> >  arch/x86/kvm/x86.c|  3 ++-
> >  3 files changed, 37 insertions(+), 1 deletion(-)
> > 
> > diff --git a/arch/x86/kvm/trace.h b/arch/x86/kvm/trace.h
> > index 2de30c20bc264..ec75efdac3560 100644
> > --- a/arch/x86/kvm/trace.h
> > +++ b/arch/x86/kvm/trace.h
> > @@ -554,6 +554,36 @@ TRACE_EVENT(kvm_nested_vmrun,
> > __entry->npt ? "on" : "off")
> >  );
> >  
> > +
> > +/*
> > + * Tracepoint for nested VMLAUNCH/VMRESUME
> > + */
> > +TRACE_EVENT(kvm_nested_vmenter,
> > +   TP_PROTO(__u64 rip, __u64 vmcs, __u64 nested_rip,
> > +__u32 entry_intr_info),
> > +   TP_ARGS(rip, vmcs, nested_rip, entry_intr_info),
> > +
> > +   TP_STRUCT__entry(
> > +   __field(__u64,  rip )
> > +   __field(__u64,  vmcs)
> > +   __field(__u64,  nested_rip  )
> > +   __field(__u32,  entry_intr_info )
> > +   ),
> > +
> > +   TP_fast_assign(
> > +   __entry->rip= rip;
> > +   __entry->vmcs   = vmcs;
> > +   __entry->nested_rip = nested_rip;
> > +   __entry->entry_intr_info= entry_intr_info;
> > +   ),
> > +
> > +   TP_printk("rip: 0x%016llx vmcs: 0x%016llx nrip: 0x%016llx "
> > + "entry_intr_info: 0x%08x",
> > +   __entry->rip, __entry->vmcs, __entry->nested_rip,
> > +   __entry->entry_intr_info)
> 
> I still don't see why VMX can't share this with SVM.  "npt' can easily be 
> "tdp",
> differentiating between VMCB and VMCS can be down with ISA, and VMX can give 0
> for int_ctl (or throw in something else interesting/relevant).

I understand very well your point, and I don't strongly disagree with you.
However let me voice my own thoughts on this:
 
I think that sharing tracepoints between SVM and VMX isn't necessarily a good 
idea.
It does make sense in some cases but not in all of them.
 
The trace points are primarily intended for developers, thus they should 
capture as
much as possible relevant info but not everything because traces can get huge.
 
Also despite the fact that a developer will look at the traces, some usability 
is welcome
as well (e.g for new developers), and looking at things like 
info1/info2/intr_info/error_code
isn't very usable (btw the error_code should at least be called 
intr_info_error_code, and
of course both it and intr_info are VMX specific).
 
So I don't even like the fact that kvm_entry/kvm_exit are shared, and neither I 
want
to add even more shared trace points.
 
I understand that there are some benefits of sharing, namely a userspace tool 
can use
the same event to *profile* kvm, but I am not sure that this is worth it.
 
What we could have done is to have ISA (and maybe even x86) agnostic 
kvm_exit/kvm_entry
tracepoints that would have no data attached to them, or have very little (like 
maybe RIP),
and then have ISA specific tracepoints with the reset of the info.
 
Same could be applied to kvm_nested_vmenter, although for this one I don't 
think that we
need an ISA agnostic tracepoint.
 
Having said all that, I am not hell bent on this. If you really want it to be 
this way,
I won't argue that much.
 
Thoughts?


Best regards,
Maxim Levitsky


> 
>   trace_kvm_nested_vmenter(kvm_rip_read(vcpu),
>vmx->nested.current_vmptr,
>vmcs12->guest_rip,
>0,
>vmcs12->vm_entry_intr_info_field,
>nested_cpu_has_ept(vmcs12),
>KVM_ISA_VMX);
> 
> diff --git a/arch/x86/kvm/trace.h b/arch/x86/kvm/trace.h
> index 2de30c20bc26..90f7cdb31fc1 100644
> --- a/arch/x86/kvm/trace.h
> +++ b/arch/x86/kvm/trace.h
> @@ -522,12 +522,12 @@ TRACE_EVENT(kvm_pv_eoi,
>  );
> 
>  /*
> - * Tracepoint for nested VMRUN
> + * Tracepoint for nested VM-Enter.  Note, vmcb==vmcs on VMX.
>   */
> -TRACE_EVENT(kvm_nested_vmrun,
> +TRACE_EVENT(kvm_nested_vmenter,
> TP_PROTO(__u64 rip, __u64 vmcb, __u64 nested_rip, __u32 int_ctl,
> -__u32 event_inj, bool npt),
> -   TP_ARGS(rip, vmcb, nested_rip, int_ctl, event_inj, npt),
> +__u32 event_inj, bool tdp, __u32 isa),
> +   TP_ARGS(rip, vmcb, nested_rip, int_ctl, event_inj, tdp, isa),
> 
> TP_STRUCT__entry(
> __field(__u64,  rip )
> @@ -535,7 +535,8 @@ TRACE_EVENT(kvm_nested_vmrun,
> __field(__u64,  nested_rip  )
> __field(__u32,