On May 8, 2025 5:15:44 AM PDT, Peter Zijlstra <pet...@infradead.org> wrote:
>On Wed, May 07, 2025 at 02:48:34PM -0700, Sohil Mehta wrote:
>> On 5/7/2025 2:14 AM, Peter Zijlstra wrote:
>> > On Tue, May 06, 2025 at 06:21:41PM -0700, Sohil Mehta wrote:
>> >>
>> >> diff --git a/arch/x86/kernel/nmi.c b/arch/x86/kernel/nmi.c
>> >> index a1d672dcb6f0..183e3e717326 100644
>> >> --- a/arch/x86/kernel/nmi.c
>> >> +++ b/arch/x86/kernel/nmi.c
>> > 
>> >>  static int nmi_handle(unsigned int type, struct pt_regs *regs)
>> >>  {
>> >>   struct nmi_desc *desc = nmi_to_desc(type);
>> >> + unsigned long source_bitmap = 0;
>> > 
>> >    unsigned long source = ~0UL;
>> > 
>> 
>> Thanks! This makes the logic even simpler by getting rid of
>> match_nmi_source(). A minor change described further down.
>> 
>> Also, do you prefer "source" over "source_bitmap"? I had it as such to
>> avoid confusion between source_vector and source_bitmap.
>
>Yeah, I was lazy typing. Perhaps just call it bitmap then?
>
>> >>   nmi_handler_t ehandler;
>> >>   struct nmiaction *a;
>> >>   int handled=0;
>> >> @@ -148,16 +164,40 @@ static int nmi_handle(unsigned int type, struct 
>> >> pt_regs *regs)
>> >>  
>> >>   rcu_read_lock();
>> >>  
>> >> + /*
>> >> +  * Activate NMI source-based filtering only for Local NMIs.
>> >> +  *
>> >> +  * Platform NMI types (such as SERR and IOCHK) have only one
>> >> +  * handler registered per type, so there is no need to
>> >> +  * disambiguate between multiple handlers.
>> >> +  *
>> >> +  * Also, if a platform source ends up setting bit 2 in the
>> >> +  * source bitmap, the local NMI handlers would be skipped since
>> >> +  * none of them use this reserved vector.
>> >> +  *
>> >> +  * For Unknown NMIs, avoid using the source bitmap to ensure all
>> >> +  * potential handlers have a chance to claim responsibility.
>> >> +  */
>> >> + if (cpu_feature_enabled(X86_FEATURE_NMI_SOURCE) && type == NMI_LOCAL)
>> >> +         source_bitmap = fred_event_data(regs);
>> > 
>> >    if (cpu_feature_enabled(X86_FEATURE_NMI_SOURCE) && type == NMI_LOCAL) {
>> >            source = fred_event_data(regs);
>> >            if (source & BIT(0))
>> >                    source = ~0UL;
>> >    }
>> > 
>> 
>> Looks good, except when fred_event_data() returns 0. I don't expect it
>> to happen in practice. But, maybe with new hardware and eventually
>> different hypervisors being involved, it is a possibility.
>> 
>> We can either call it a bug that an NMI happened without source
>> information. Or be extra nice and do this:
>> 
>> if (cpu_feature_enabled(X86_FEATURE_NMI_SOURCE) && type == NMI_LOCAL) {
>>      source = fred_event_data(regs);
>>      if (!source || (source & BIT(0)))
>>              source = ~0UL;
>> }
>
>Perhaps also WARN about the !source case?

A 0 should be interpreted such that NMI source is not available, e.g. due to a 
broken hypervisor or similar.

Reply via email to