Re: [Intel-gfx] Regression on linux-next (next-20231013)

2023-10-26 Thread Christian Brauner
On Thu, Oct 26, 2023 at 10:14:23AM +, Borah, Chaitanya Kumar wrote:
> Hello Christian,
> 
> > -Original Message-
> > From: Borah, Chaitanya Kumar
> > Sent: Wednesday, October 25, 2023 7:15 PM
> > To: Christian Brauner 
> > Cc: intel-gfx@lists.freedesktop.org; Kurmi, Suresh Kumar
> > ; Saarinen, Jani 
> > Subject: RE: Regression on linux-next (next-20231013)
> > 
> > Hello Christian,
> > 
> > > -Original Message-
> > > From: Christian Brauner 
> > > Sent: Wednesday, October 25, 2023 1:02 PM
> > > To: Borah, Chaitanya Kumar 
> > > Cc: intel-gfx@lists.freedesktop.org; Kurmi, Suresh Kumar
> > > ; Saarinen, Jani
> > > 
> > > Subject: Re: Regression on linux-next (next-20231013)
> > >
> > > On Wed, Oct 25, 2023 at 06:32:01AM +, Borah, Chaitanya Kumar wrote:
> > > >  Hello Christian,
> > > >
> > > >  Hope you are doing well. I am Chaitanya from the linux graphics
> > > > team in
> > > Intel.
> > > >
> > > >  This mail is regarding a regression we are seeing in our CI runs[1]
> > > > on linux-next  repository.
> > >
> > > Any chance I can reproduce this locally?
> > 
> > Thank you for your response.
> > 
> > I see that you have already floated a patch [1] to fix the issue. We will 
> > test it
> > and get back to you ASAP.
> 
> The solution is working for us.
> 
> Also, linux-next turned green.

Great! That already has the final version of the patch.

> http://gfx-ci.igk.intel.com/tree/linux-next/igt@i915_selftest@l...@mman.html
> 
> Thank you.

Thanks for the report!


Re: [Intel-gfx] Regression on linux-next (next-20231013)

2023-10-26 Thread Borah, Chaitanya Kumar
Hello Christian,

> -Original Message-
> From: Borah, Chaitanya Kumar
> Sent: Wednesday, October 25, 2023 7:15 PM
> To: Christian Brauner 
> Cc: intel-gfx@lists.freedesktop.org; Kurmi, Suresh Kumar
> ; Saarinen, Jani 
> Subject: RE: Regression on linux-next (next-20231013)
> 
> Hello Christian,
> 
> > -Original Message-
> > From: Christian Brauner 
> > Sent: Wednesday, October 25, 2023 1:02 PM
> > To: Borah, Chaitanya Kumar 
> > Cc: intel-gfx@lists.freedesktop.org; Kurmi, Suresh Kumar
> > ; Saarinen, Jani
> > 
> > Subject: Re: Regression on linux-next (next-20231013)
> >
> > On Wed, Oct 25, 2023 at 06:32:01AM +, Borah, Chaitanya Kumar wrote:
> > >  Hello Christian,
> > >
> > >  Hope you are doing well. I am Chaitanya from the linux graphics
> > > team in
> > Intel.
> > >
> > >  This mail is regarding a regression we are seeing in our CI runs[1]
> > > on linux-next  repository.
> >
> > Any chance I can reproduce this locally?
> 
> Thank you for your response.
> 
> I see that you have already floated a patch [1] to fix the issue. We will 
> test it
> and get back to you ASAP.

The solution is working for us.

Also, linux-next turned green.

http://gfx-ci.igk.intel.com/tree/linux-next/igt@i915_selftest@l...@mman.html

Thank you.

Regards

Chaitanya

> 
> In case, you still need it.
> 
> If you happen to have a device with intel CPU on it (we are seeing it in
> machines as old as Gen3[2]), you can follow the below steps.
> 
> 1. Get the latest drm-tip from https://cgit.freedesktop.org/drm-tip/ and 
> install
> it on the machine
> 
> 2. Get IGT suite from https://gitlab.freedesktop.org/drm/igt-gpu-tools
> 
> 3. Build the test suite
> You can use the instructions in the README.md file for building the suite.
> 
> We use ubuntu and I generally do the following
> 
>   a) Make sure the packages listed in Dockerfile.build-debian-minimal
> and Dockerfile.build-debian are installed.
>   b) meson build && ninja -C build
> 
> 4. If everything goes fine, there should be a "build" folder created within 
> the
> base folder of your repository
> Then run the test using the following command.
> 
>   sudo build/tests/i915_selftest --run-subtest live
> 
> Regards
> 
> Chaitanya
> 
> 
> [1] https://lore.kernel.org/intel-gfx/20231025-formfrage-watscheln-
> 84526cd3bd7d@brauner/
> [2] http://gfx-ci.igk.intel.com/tree/linux-
> next/igt@i915_selftest@l...@mman.html



Re: [Intel-gfx] Regression on linux-next (next-20231013)

2023-10-25 Thread Borah, Chaitanya Kumar
Hello Christian,

> -Original Message-
> From: Christian Brauner 
> Sent: Wednesday, October 25, 2023 1:02 PM
> To: Borah, Chaitanya Kumar 
> Cc: intel-gfx@lists.freedesktop.org; Kurmi, Suresh Kumar
> ; Saarinen, Jani 
> Subject: Re: Regression on linux-next (next-20231013)
> 
> On Wed, Oct 25, 2023 at 06:32:01AM +, Borah, Chaitanya Kumar wrote:
> >  Hello Christian,
> >
> >  Hope you are doing well. I am Chaitanya from the linux graphics team in
> Intel.
> >
> >  This mail is regarding a regression we are seeing in our CI runs[1]
> > on linux-next  repository.
> 
> Any chance I can reproduce this locally?

Thank you for your response.

I see that you have already floated a patch [1] to fix the issue. We will test 
it and get back to you ASAP.

In case, you still need it.

If you happen to have a device with intel CPU on it (we are seeing it in 
machines as old as Gen3[2]), you can follow the below steps.

1. Get the latest drm-tip from https://cgit.freedesktop.org/drm-tip/ and 
install it on the machine

2. Get IGT suite from https://gitlab.freedesktop.org/drm/igt-gpu-tools

3. Build the test suite
You can use the instructions in the README.md file for building the suite.

We use ubuntu and I generally do the following

a) Make sure the packages listed in Dockerfile.build-debian-minimal and 
Dockerfile.build-debian are installed.
b) meson build && ninja -C build

4. If everything goes fine, there should be a "build" folder created within the 
base folder of your repository
Then run the test using the following command.

sudo build/tests/i915_selftest --run-subtest live

Regards

Chaitanya


[1] 
https://lore.kernel.org/intel-gfx/20231025-formfrage-watscheln-84526cd3bd7d@brauner/
[2] http://gfx-ci.igk.intel.com/tree/linux-next/igt@i915_selftest@l...@mman.html



Re: [Intel-gfx] Regression on linux-next (next-20231013)

2023-10-25 Thread Christian Brauner
On Wed, Oct 25, 2023 at 06:32:01AM +, Borah, Chaitanya Kumar wrote:
>  Hello Christian,
>  
>  Hope you are doing well. I am Chaitanya from the linux graphics team in 
> Intel.
>  
>  This mail is regarding a regression we are seeing in our CI runs[1] on 
> linux-next
>  repository.

Any chance I can reproduce this locally?


[Intel-gfx] Regression on linux-next (next-20231013)

2023-10-25 Thread Borah, Chaitanya Kumar
 Hello Christian,
 
 Hope you are doing well. I am Chaitanya from the linux graphics team in Intel.
 
 This mail is regarding a regression we are seeing in our CI runs[1] on 
linux-next
 repository.
 
 Since the version next-20231013 [2], we are seeing the following RCU splat
 ```
 <3> [511.395679] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
<3> [511.395716] rcu:   Tasks blocked on level-1 rcu_node (CPUs 0-9): P6238
<3> [511.395934] rcu:   (detected by 16, t=65002 jiffies, g=123977, q=439 
ncpus=20)
<6> [511.395944] task:i915_selftest   state:R  running task stack:10568 
pid:6238  tgid:6238  ppid:1001   flags:0x4002
 
`
 
 Details log can be found in [3].
 
 After bisecting the tree, the following patch [4] seems to be the first "bad" 
commit
 
 
`
commit 3a77344f50d847d51abb8629a6f181cb21684157
Author: Christian Brauner 
Date:   Fri Sep 29 08:45:59 2023 +0200

file: convert to SLAB_TYPESAFE_BY_RCU
`
 
 We also verified that if we reset the tree to the parent commit the issue is 
not seen.
 
Could you please check how the commit results in the issue?

Thank you.

Regards

Chaitanya

 [1] https://intel-gfx-ci.01.org/tree/linux-next/combined-alt.html?
 [2] 
https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/commit/?h=next-20231013
 [3] 
https://intel-gfx-ci.01.org/tree/linux-next/next-20231013/bat-dg2-11/igt@i915_selftest@l...@mman.html
 [4] 
https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/commit/?h=next-20231013=3a77344f50d847d51abb8629a6f181cb21684157