Re: [patch 05/39] genirq/msi: Remove filter from msi_free_descs_free_range()

2023-03-02 Thread Miquel Raynal
Hi Thomas,

t...@linutronix.de wrote on Wed, 01 Mar 2023 22:07:48 +0100:

> Miquel!
> 
> On Wed, Mar 01 2023 at 11:55, Miquel Raynal wrote:
> > t...@linutronix.de wrote on Fri, 11 Nov 2022 14:54:22 +0100 (CET):
> >  
> >> When a range of descriptors is freed then all of them are not associated to
> >> a linux interrupt. Remove the filter and add a warning to the free 
> >> function.
> >> +  /* Leak the descriptor when it is still referenced */
> >> +  if (WARN_ON_ONCE(msi_desc_match(desc, MSI_DESC_ASSOCIATED)))
> >> +  continue;
> >> +  msi_free_desc(desc);
> >>}
> >>  }  
> >
> > It looks like since this commit I am getting warnings upon EPROBE_DEFER
> > errors in the mvpp2 Marvell Ethernet driver. I looked a bit at the
> > internals to understand why this warning was shown, and it seems that
> > nothing "de-references" the descriptors, which would mean here:
> > resetting desc->irq to 0.  
> 
> Correct. This platform-msi ^(*&!@&^ hack really needs to die ASAP.

:-)

> Marc, where are we on that? Is this still in limbo?
> 
> > I am wondering how useful thisd WARN_ON() is, or otherwise where the  
> 
> It is useful as it caught bugs already.

Sure.

> > desc->irq entry should be zeroed (if I understand that correctly), any
> > help will be appreciated.  
> 
> Untested workaround below.

Excellent!

> I hate it with a passion, but *shrug*.

:'D

> Thanks,
> 
> tglx
> ---
>  drivers/base/platform-msi.c |1 +
>  include/linux/msi.h |2 ++
>  kernel/irq/msi.c|   23 ++-
>  3 files changed, 25 insertions(+), 1 deletion(-)

Kudos for the diff, which indeed works perfectly in my case. I guess
you'll make a proper patch soon, if that's the case you can add my:

Tested-by: Miquel Raynal 

Let me know otherwise.

Thanks a lot for the very quick fix!
Miquèl

> --- a/drivers/base/platform-msi.c
> +++ b/drivers/base/platform-msi.c
> @@ -324,6 +324,7 @@ void platform_msi_device_domain_free(str
>   struct platform_msi_priv_data *data = domain->host_data;
>  
>   msi_lock_descs(data->dev);
> + msi_domain_depopulate_descs(data->dev, virq, nr_irqs);
>   irq_domain_free_irqs_common(domain, virq, nr_irqs);
>   msi_free_msi_descs_range(data->dev, virq, virq + nr_irqs - 1);
>   msi_unlock_descs(data->dev);
> --- a/include/linux/msi.h
> +++ b/include/linux/msi.h
> @@ -631,6 +631,8 @@ int msi_domain_prepare_irqs(struct irq_d
>   int nvec, msi_alloc_info_t *args);
>  int msi_domain_populate_irqs(struct irq_domain *domain, struct device *dev,
>int virq, int nvec, msi_alloc_info_t *args);
> +void msi_domain_depopulate_descs(struct device *dev, int virq, int nvec);
> +
>  struct irq_domain *
>  __platform_msi_create_device_domain(struct device *dev,
>   unsigned int nvec,
> --- a/kernel/irq/msi.c
> +++ b/kernel/irq/msi.c
> @@ -1109,14 +1109,35 @@ int msi_domain_populate_irqs(struct irq_
>   return 0;
>  
>  fail:
> - for (--virq; virq >= virq_base; virq--)
> + for (--virq; virq >= virq_base; virq--) {
> + msi_domain_depopulate_descs(dev, virq, 1);
>   irq_domain_free_irqs_common(domain, virq, 1);
> + }
>   msi_domain_free_descs(dev, );
>  unlock:
>   msi_unlock_descs(dev);
>   return ret;
>  }
>  
> +void msi_domain_depopulate_descs(struct device *dev, int virq_base, int nvec)
> +{
> + struct msi_ctrl ctrl = {
> + .domid  = MSI_DEFAULT_DOMAIN,
> + .first  = virq_base,
> + .last   = virq_base + nvec - 1,
> + };
> + struct msi_desc *desc;
> + struct xarray *xa;
> + unsigned long idx;
> +
> + if (!msi_ctrl_valid(dev, ))
> + return;
> +
> + xa = >msi.data->__domains[ctrl.domid].store;
> + xa_for_each_range(xa, idx, desc, ctrl.first, ctrl.last)
> + desc->irq = 0;
> +}
> +
>  /*
>   * Carefully check whether the device can use reservation mode. If
>   * reservation mode is enabled then the early activation will assign a


Re: [patch 05/39] genirq/msi: Remove filter from msi_free_descs_free_range()

2023-03-01 Thread Thomas Gleixner
Miquel!

On Wed, Mar 01 2023 at 11:55, Miquel Raynal wrote:
> t...@linutronix.de wrote on Fri, 11 Nov 2022 14:54:22 +0100 (CET):
>
>> When a range of descriptors is freed then all of them are not associated to
>> a linux interrupt. Remove the filter and add a warning to the free function.
>> +/* Leak the descriptor when it is still referenced */
>> +if (WARN_ON_ONCE(msi_desc_match(desc, MSI_DESC_ASSOCIATED)))
>> +continue;
>> +msi_free_desc(desc);
>>  }
>>  }
>
> It looks like since this commit I am getting warnings upon EPROBE_DEFER
> errors in the mvpp2 Marvell Ethernet driver. I looked a bit at the
> internals to understand why this warning was shown, and it seems that
> nothing "de-references" the descriptors, which would mean here:
> resetting desc->irq to 0.

Correct. This platform-msi ^(*&!@&^ hack really needs to die ASAP.

Marc, where are we on that? Is this still in limbo?

> I am wondering how useful thisd WARN_ON() is, or otherwise where the

It is useful as it caught bugs already.

> desc->irq entry should be zeroed (if I understand that correctly), any
> help will be appreciated.

Untested workaround below. I hate it with a passion, but *shrug*.

Thanks,

tglx
---
 drivers/base/platform-msi.c |1 +
 include/linux/msi.h |2 ++
 kernel/irq/msi.c|   23 ++-
 3 files changed, 25 insertions(+), 1 deletion(-)

--- a/drivers/base/platform-msi.c
+++ b/drivers/base/platform-msi.c
@@ -324,6 +324,7 @@ void platform_msi_device_domain_free(str
struct platform_msi_priv_data *data = domain->host_data;
 
msi_lock_descs(data->dev);
+   msi_domain_depopulate_descs(data->dev, virq, nr_irqs);
irq_domain_free_irqs_common(domain, virq, nr_irqs);
msi_free_msi_descs_range(data->dev, virq, virq + nr_irqs - 1);
msi_unlock_descs(data->dev);
--- a/include/linux/msi.h
+++ b/include/linux/msi.h
@@ -631,6 +631,8 @@ int msi_domain_prepare_irqs(struct irq_d
int nvec, msi_alloc_info_t *args);
 int msi_domain_populate_irqs(struct irq_domain *domain, struct device *dev,
 int virq, int nvec, msi_alloc_info_t *args);
+void msi_domain_depopulate_descs(struct device *dev, int virq, int nvec);
+
 struct irq_domain *
 __platform_msi_create_device_domain(struct device *dev,
unsigned int nvec,
--- a/kernel/irq/msi.c
+++ b/kernel/irq/msi.c
@@ -1109,14 +1109,35 @@ int msi_domain_populate_irqs(struct irq_
return 0;
 
 fail:
-   for (--virq; virq >= virq_base; virq--)
+   for (--virq; virq >= virq_base; virq--) {
+   msi_domain_depopulate_descs(dev, virq, 1);
irq_domain_free_irqs_common(domain, virq, 1);
+   }
msi_domain_free_descs(dev, );
 unlock:
msi_unlock_descs(dev);
return ret;
 }
 
+void msi_domain_depopulate_descs(struct device *dev, int virq_base, int nvec)
+{
+   struct msi_ctrl ctrl = {
+   .domid  = MSI_DEFAULT_DOMAIN,
+   .first  = virq_base,
+   .last   = virq_base + nvec - 1,
+   };
+   struct msi_desc *desc;
+   struct xarray *xa;
+   unsigned long idx;
+
+   if (!msi_ctrl_valid(dev, ))
+   return;
+
+   xa = >msi.data->__domains[ctrl.domid].store;
+   xa_for_each_range(xa, idx, desc, ctrl.first, ctrl.last)
+   desc->irq = 0;
+}
+
 /*
  * Carefully check whether the device can use reservation mode. If
  * reservation mode is enabled then the early activation will assign a


Re: [patch 05/39] genirq/msi: Remove filter from msi_free_descs_free_range()

2023-03-01 Thread Miquel Raynal
Hi Thomas,

t...@linutronix.de wrote on Fri, 11 Nov 2022 14:54:22 +0100 (CET):

> When a range of descriptors is freed then all of them are not associated to
> a linux interrupt. Remove the filter and add a warning to the free function.
> 
> Signed-off-by: Thomas Gleixner 
> ---

[...]

> --- a/kernel/irq/msi.c
> +++ b/kernel/irq/msi.c
> @@ -120,7 +120,7 @@ static int msi_add_simple_msi_descs(stru
>  fail_mem:
>   ret = -ENOMEM;
>  fail:
> - msi_free_msi_descs_range(dev, MSI_DESC_ALL, index, last);
> + msi_free_msi_descs_range(dev, index, last);
>   return ret;
>  }
>  
> @@ -141,12 +141,11 @@ static bool msi_desc_match(struct msi_de
>  /**
>   * msi_free_msi_descs_range - Free MSI descriptors of a device
>   * @dev: Device to free the descriptors
> - * @filter:  Descriptor state filter
>   * @first_index: Index to start freeing from
>   * @last_index:  Last index to be freed
>   */
> -void msi_free_msi_descs_range(struct device *dev, enum msi_desc_filter 
> filter,
> -   unsigned int first_index, unsigned int last_index)
> +void msi_free_msi_descs_range(struct device *dev, unsigned int first_index,
> +   unsigned int last_index)
>  {
>   struct xarray *xa = >msi.data->__store;
>   struct msi_desc *desc;
> @@ -155,10 +154,12 @@ void msi_free_msi_descs_range(struct dev
>   lockdep_assert_held(>msi.data->mutex);
>  
>   xa_for_each_range(xa, idx, desc, first_index, last_index) {
> - if (msi_desc_match(desc, filter)) {
> - xa_erase(xa, idx);
> - msi_free_desc(desc);
> - }
> + xa_erase(xa, idx);
> +
> + /* Leak the descriptor when it is still referenced */
> + if (WARN_ON_ONCE(msi_desc_match(desc, MSI_DESC_ASSOCIATED)))
> + continue;
> + msi_free_desc(desc);
>   }
>  }

It looks like since this commit I am getting warnings upon EPROBE_DEFER
errors in the mvpp2 Marvell Ethernet driver. I looked a bit at the
internals to understand why this warning was shown, and it seems that
nothing "de-references" the descriptors, which would mean here:
resetting desc->irq to 0.

In my case I don't think the mvpp2_main.c driver nor the
irq_mvebu_icu.c driver behind do anything strange (as far as I
understand them). I believe any error during a ->probe() leading
to an irq_dispose_mapping() call with MSI behind will trigger that
warning.

I am wondering how useful thisd WARN_ON() is, or otherwise where the
desc->irq entry should be zeroed (if I understand that correctly), any
help will be appreciated.

Here is the splat:

[2.045857] [ cut here ]
[2.050497] WARNING: CPU: 2 PID: 9 at kernel/irq/msi.c:197 
msi_domain_free_descs+0x120/0x128
[2.058993] Modules linked in:
[2.062068] CPU: 2 PID: 9 Comm: kworker/u8:0 Not tainted 6.2.0-rc1+ #168
[2.068804] Hardware name: Delta TN48M (DT)
[2.073008] Workqueue: events_unbound deferred_probe_work_func
[2.078878] pstate: 0005 (nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[2.085875] pc : msi_domain_free_descs+0x120/0x128
[2.090693] lr : msi_domain_free_descs+0xe0/0x128
[2.095423] sp : 8a82b8d0
[2.098745] x29: 8a82b8d0 x28: 7bfdbb508ca8 x27: 000102e28940
[2.105921] x26: 0004 x25: 000100257660 x24: 89a6b8d8
[2.113096] x23: 88a1c868 x22: 000102e4b700 x21: 000101494bb0
[2.120270] x20: 8a82b958 x19: 89afe248 x18: 0010
[2.127444] x17: fcfa4008 x16: 0008 x15: 013a
[2.134618] x14: 8a82b6a0 x13: ffea x12: 8a82b870
[2.141794] x11: 0002 x10:  x9 : 0001
[2.148967] x8 :  x7 : 0238 x6 : 0001005f1230
[2.156141] x5 :  x4 : 2000 x3 : 
[2.163315] x2 : b586accf01c45400 x1 : 0001000e x0 : 002d
[2.170489] Call trace:
[2.172948]  msi_domain_free_descs+0x120/0x128
[2.177417]  msi_domain_free_msi_descs_range+0x64/0x9c
[2.182586]  platform_msi_device_domain_free+0x88/0xb8
[2.187752]  mvebu_icu_irq_domain_free+0x60/0x80
[2.192396]  irq_domain_free_irqs_hierarchy.part.21+0x94/0xa8
[2.198173]  irq_domain_free_irqs+0xec/0x150
[2.202466]  irq_dispose_mapping+0x104/0x120
[2.206758]  mvpp2_probe+0x2028/0x21f8
[2.210530]  platform_probe+0x68/0xd8
[2.214210]  really_probe+0xbc/0x2a8
[2.217807]  __driver_probe_device+0x78/0xe0
[2.222102]  driver_probe_device+0x3c/0x108
[2.226308]  __device_attach_driver+0xb8/0xf8
[2.230690]  bus_for_each_drv+0x7c/0xd0
[2.234547]  __device_attach+0xec/0x188
[2.238404]  device_initial_probe+0x14/0x20
[2.242611]  bus_probe_device+0x9c/0xa8
[2.246468]  

Re: [patch 05/39] genirq/msi: Remove filter from msi_free_descs_free_range()

2022-11-16 Thread Jason Gunthorpe
On Fri, Nov 11, 2022 at 02:54:22PM +0100, Thomas Gleixner wrote:
> When a range of descriptors is freed then all of them are not associated to
> a linux interrupt. Remove the filter and add a warning to the free function.
> 
> Signed-off-by: Thomas Gleixner 
> ---
>  drivers/base/platform-msi.c |2 +-
>  include/linux/msi.h |5 ++---
>  kernel/irq/msi.c|   19 ++-
>  3 files changed, 13 insertions(+), 13 deletions(-)

Reviewed-by: Jason Gunthorpe 

Jason