Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-06 Thread Sundaresan, Sujaritha


-Original Message-
From: Chris Wilson  
Sent: Wednesday, March 6, 2019 1:08 AM
To: Wajdeczko, Michal ; Sundaresan, Sujaritha 
; intel-gfx@lists.freedesktop.org
Subject: Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM 
initialization

Quoting Michal Wajdeczko (2019-03-06 09:01:09)
> On Wed, 06 Mar 2019 09:45:17 +0100, Chris Wilson
> > Now doing an if (i915_error_injected() && !err) err = -EINVAL; makes 
> > sense to catch places where we've eaten that error and so breaking 
> > the test.
> 
> This will not work today, as at least in one place - i915_gem_init - 
> we inject -EIO which later we try to replace with success. And for 
> that case we may rather want to add

I think we may want to move that particular test to one side, something along 
the lines of i915.disable_gpu=1 ?
-Chris

Just to clarify on how to proceed with the fix. I'm not sure if the suggestion 
is to go back to the previous version of 
the patch or to go for a different approach. It also looks like this issue will 
be flaggin up on the public CI, following the
changes Arek has said will be made to the dmesg style.

-Sujaritha
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-06 Thread Sujaritha


On 3/6/19 1:08 AM, Chris Wilson wrote:

Quoting Michal Wajdeczko (2019-03-06 09:01:09)

On Wed, 06 Mar 2019 09:45:17 +0100, Chris Wilson

Now doing an if (i915_error_injected() && !err) err = -EINVAL; makes
sense to catch places where we've eaten that error and so breaking the
test.

This will not work today, as at least in one place - i915_gem_init - we
inject -EIO which later we try to replace with success. And for that case
we may rather want to add

I think we may want to move that particular test to one side, something
along the lines of i915.disable_gpu=1 ?
-Chris


Just to clarify on how to proceed with the fix. I'm not sure if the

suggestion is to go back to the previous version of the fix or to

go for a different approach. Also Arek has said in the internal

mailing list that there are some changes being made to the

-piglit-style-dmesg, so looks like this issue will be flagging up in

the public CI as well.

-Sujaritha

___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-06 Thread Chris Wilson
Quoting Michal Wajdeczko (2019-03-06 09:01:09)
> On Wed, 06 Mar 2019 09:45:17 +0100, Chris Wilson  
> > Now doing an if (i915_error_injected() && !err) err = -EINVAL; makes
> > sense to catch places where we've eaten that error and so breaking the
> > test.
> 
> This will not work today, as at least in one place - i915_gem_init - we
> inject -EIO which later we try to replace with success. And for that case
> we may rather want to add

I think we may want to move that particular test to one side, something
along the lines of i915.disable_gpu=1 ?
-Chris
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-06 Thread Michal Wajdeczko
On Wed, 06 Mar 2019 09:45:17 +0100, Chris Wilson  
 wrote:



Quoting Michal Wajdeczko (2019-03-06 08:41:20)

On Wed, 06 Mar 2019 01:30:11 +0100, Sujaritha Sundaresan
 wrote:

> Replacing the -E2BIG error code return for WOPCM
> initialization with -ENODEV. This will prevent the pci from

s/pci/CI ?

> picking this up as a warning during fault injection testing.
>
> v2: change the final return code in i915_pci_probe() to ENODEV
> instead of the specific wopcm change. - Daniele

s/wopcm/WOPCM

>
> Cc: Daniele Ceraolo Spurio 
> Cc: Michal Wajdeczko 
> Signed-off-by: Sujaritha Sundaresan 
> ---
>  drivers/gpu/drm/i915/i915_pci.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/i915/i915_pci.c
> b/drivers/gpu/drm/i915/i915_pci.c
> index c42c5ccf38fe..f962b5c0b3c1 100644
> --- a/drivers/gpu/drm/i915/i915_pci.c
> +++ b/drivers/gpu/drm/i915/i915_pci.c
> @@ -778,7 +778,7 @@ static int i915_pci_probe(struct pci_dev *pdev,
> const struct pci_device_id *ent)
>   err = i915_driver_load(pdev, ent);
>   if (err)
> - return err;
> + return i915_error_injected() ? -ENODEV : err;
>   if (i915_inject_load_failure()) {
>   i915_pci_remove(pdev);

Hmm, since this is quite big deviation from original approach, maybe
you should sent it as new patch (with updated title and commit message)
as now this covers all injected errors (not just E2BIG from WOPCM)


And I'm not sure we want to do that. When we do failure injection, we
pick the errno and part of the test is making sure that gets reported
unadulterated.

Now doing an if (i915_error_injected() && !err) err = -EINVAL; makes
sense to catch places where we've eaten that error and so breaking the
test.


This will not work today, as at least in one place - i915_gem_init - we
inject -EIO which later we try to replace with success. And for that case
we may rather want to add

if (i915_error_injected() && !err)
i915_pci_remove(pdev);
return -ENODEV;
}

~Michal
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-06 Thread Chris Wilson
Quoting Michal Wajdeczko (2019-03-06 08:41:20)
> On Wed, 06 Mar 2019 01:30:11 +0100, Sujaritha Sundaresan  
>  wrote:
> 
> > Replacing the -E2BIG error code return for WOPCM
> > initialization with -ENODEV. This will prevent the pci from
> 
> s/pci/CI ?
> 
> > picking this up as a warning during fault injection testing.
> >
> > v2: change the final return code in i915_pci_probe() to ENODEV
> > instead of the specific wopcm change. - Daniele
> 
> s/wopcm/WOPCM
> 
> >
> > Cc: Daniele Ceraolo Spurio 
> > Cc: Michal Wajdeczko 
> > Signed-off-by: Sujaritha Sundaresan 
> > ---
> >  drivers/gpu/drm/i915/i915_pci.c | 2 +-
> >  1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > diff --git a/drivers/gpu/drm/i915/i915_pci.c  
> > b/drivers/gpu/drm/i915/i915_pci.c
> > index c42c5ccf38fe..f962b5c0b3c1 100644
> > --- a/drivers/gpu/drm/i915/i915_pci.c
> > +++ b/drivers/gpu/drm/i915/i915_pci.c
> > @@ -778,7 +778,7 @@ static int i915_pci_probe(struct pci_dev *pdev,  
> > const struct pci_device_id *ent)
> >   err = i915_driver_load(pdev, ent);
> >   if (err)
> > - return err;
> > + return i915_error_injected() ? -ENODEV : err;
> >   if (i915_inject_load_failure()) {
> >   i915_pci_remove(pdev);
> 
> Hmm, since this is quite big deviation from original approach, maybe
> you should sent it as new patch (with updated title and commit message)
> as now this covers all injected errors (not just E2BIG from WOPCM)

And I'm not sure we want to do that. When we do failure injection, we
pick the errno and part of the test is making sure that gets reported
unadulterated.

Now doing an if (i915_error_injected() && !err) err = -EINVAL; makes
sense to catch places where we've eaten that error and so breaking the
test.
-Chris
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

Re: [Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-06 Thread Michal Wajdeczko
On Wed, 06 Mar 2019 01:30:11 +0100, Sujaritha Sundaresan  
 wrote:



Replacing the -E2BIG error code return for WOPCM
initialization with -ENODEV. This will prevent the pci from


s/pci/CI ?


picking this up as a warning during fault injection testing.

v2: change the final return code in i915_pci_probe() to ENODEV
instead of the specific wopcm change. - Daniele


s/wopcm/WOPCM



Cc: Daniele Ceraolo Spurio 
Cc: Michal Wajdeczko 
Signed-off-by: Sujaritha Sundaresan 
---
 drivers/gpu/drm/i915/i915_pci.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/gpu/drm/i915/i915_pci.c  
b/drivers/gpu/drm/i915/i915_pci.c

index c42c5ccf38fe..f962b5c0b3c1 100644
--- a/drivers/gpu/drm/i915/i915_pci.c
+++ b/drivers/gpu/drm/i915/i915_pci.c
@@ -778,7 +778,7 @@ static int i915_pci_probe(struct pci_dev *pdev,  
const struct pci_device_id *ent)

err = i915_driver_load(pdev, ent);
if (err)
-   return err;
+   return i915_error_injected() ? -ENODEV : err;
if (i915_inject_load_failure()) {
i915_pci_remove(pdev);


Hmm, since this is quite big deviation from original approach, maybe
you should sent it as new patch (with updated title and commit message)
as now this covers all injected errors (not just E2BIG from WOPCM)

Anyway, with typos fixed this is:

Reviewed-by: Michal Wajdeczko 

Thanks,
~Michal
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

[Intel-gfx] [PATCH v2] drm/i915/guc: Fixing error code for WOPCM initialization

2019-03-05 Thread Sujaritha Sundaresan
Replacing the -E2BIG error code return for WOPCM
initialization with -ENODEV. This will prevent the pci from
picking this up as a warning during fault injection testing.

v2: change the final return code in i915_pci_probe() to ENODEV
instead of the specific wopcm change. - Daniele

Cc: Daniele Ceraolo Spurio 
Cc: Michal Wajdeczko 
Signed-off-by: Sujaritha Sundaresan 
---
 drivers/gpu/drm/i915/i915_pci.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/gpu/drm/i915/i915_pci.c b/drivers/gpu/drm/i915/i915_pci.c
index c42c5ccf38fe..f962b5c0b3c1 100644
--- a/drivers/gpu/drm/i915/i915_pci.c
+++ b/drivers/gpu/drm/i915/i915_pci.c
@@ -778,7 +778,7 @@ static int i915_pci_probe(struct pci_dev *pdev, const 
struct pci_device_id *ent)
 
err = i915_driver_load(pdev, ent);
if (err)
-   return err;
+   return i915_error_injected() ? -ENODEV : err;
 
if (i915_inject_load_failure()) {
i915_pci_remove(pdev);
-- 
2.20.1

___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx