[+cc linux-pci]

On Thu, Mar 16, 2017 at 10:50:06PM +0100, Thomas Gleixner wrote:
> The generic pci configuration space accessors are globally serialized via
> pci_lock. On larger systems this causes massive lock contention when the
> configuration space has to be accessed frequently. One such access pattern
> is the Intel Uncore performance counter unit.

s/pci/PCI/ above.

> Provide a kernel config option which can be selected by an architecture
> when the low level PCI configuration space accessors in the architecture
> use their own serialization or can operate completely lockless.

The arch/x86/pci/common.c comment:

  /*
   * This interrupt-safe spinlock protects all accesses to PCI
   * configuration space.
   */
  DEFINE_RAW_SPINLOCK(pci_config_lock);

is no longer quite correct.

I think the raw_pci_read() and raw_pci_write() implementations are
such that we use the old locked accessors for the first 256 bytes,
even when ECAM is available.  Not necessarily a problem, just an
observation.  I guess the uncore PMU registers are in the extended
config space.

> Signed-off-by: Thomas Gleixner <[email protected]>
> ---
>  drivers/pci/Kconfig  |    3 +++
>  drivers/pci/access.c |   16 ++++++++++++----
>  2 files changed, 15 insertions(+), 4 deletions(-)
> 
> --- a/drivers/pci/Kconfig
> +++ b/drivers/pci/Kconfig
> @@ -86,6 +86,9 @@ config PCI_ATS
>  config PCI_ECAM
>       bool
>  
> +config PCI_LOCKLESS_CONFIG
> +     bool

It's conceivable that this could be a per-host bridge property, but
not worth worrying about for now.

>  config PCI_IOV
>       bool "PCI IOV support"
>       depends on PCI
> --- a/drivers/pci/access.c
> +++ b/drivers/pci/access.c
> @@ -25,6 +25,14 @@ DEFINE_RAW_SPINLOCK(pci_lock);
>  #define PCI_word_BAD (pos & 1)
>  #define PCI_dword_BAD (pos & 3)
>  
> +#ifdef CONFIG_PCI_LOCKLESS_CONFIG
> +# define pci_lock_config(f)  do { (void)(f); } while (0)
> +# define pci_unlock_config(f)        do { (void)(f); } while (0)
> +#else
> +# define pci_lock_config(f)  raw_spin_lock_irqsave(&pci_lock, f)
> +# define pci_unlock_config(f)        raw_spin_unlock_irqrestore(&pci_lock, f)
> +#endif
> +
>  #define PCI_OP_READ(size, type, len) \
>  int pci_bus_read_config_##size \
>       (struct pci_bus *bus, unsigned int devfn, int pos, type *value) \
> @@ -33,10 +41,10 @@ int pci_bus_read_config_##size \
>       unsigned long flags;                                            \
>       u32 data = 0;                                                   \
>       if (PCI_##size##_BAD) return PCIBIOS_BAD_REGISTER_NUMBER;       \
> -     raw_spin_lock_irqsave(&pci_lock, flags);                        \
> +     pci_lock_config(flags);                                         \
>       res = bus->ops->read(bus, devfn, pos, len, &data);              \
>       *value = (type)data;                                            \
> -     raw_spin_unlock_irqrestore(&pci_lock, flags);           \
> +     pci_unlock_config(flags);                                       \
>       return res;                                                     \
>  }
>  
> @@ -47,9 +55,9 @@ int pci_bus_write_config_##size \
>       int res;                                                        \
>       unsigned long flags;                                            \
>       if (PCI_##size##_BAD) return PCIBIOS_BAD_REGISTER_NUMBER;       \
> -     raw_spin_lock_irqsave(&pci_lock, flags);                        \
> +     pci_lock_config(flags);                                         \
>       res = bus->ops->write(bus, devfn, pos, len, value);             \
> -     raw_spin_unlock_irqrestore(&pci_lock, flags);           \
> +     pci_unlock_config(flags);                                       \
>       return res;                                                     \
>  }
>  
> 
> 

Reply via email to