Re: [PATCH] drm/amdgpu: add the checking to avoid NULL pointer dereference

2018-11-30 Thread Sharma, Deepak
Did find way to reproduce issue constantly. After applying David's patch "0001-drm-amdgpu-fix-signaled-fence-isn-t-handled" with minor change -static struct dma_fence *drm_syncobj_get_stub_fence(void) +struct dma_fence *drm_syncobj_get_stub_fence(void) was able to avoid kernel panic due to NUL

Re: [Intel-gfx] [PATCH RFC 2/5] cgroup: Add mechanism to register vendor specific DRM devices

2018-11-30 Thread Matt Roper
On Wed, Nov 28, 2018 at 07:46:06PM +, Ho, Kenny wrote: > > On Wed, Nov 28, 2018 at 4:14 AM Joonas Lahtinen > wrote: > > So we can only choose the lowest common denominator, right? > > > > Any core count out of total core count should translate nicely into a > > fraction, so what would be the

[PATCH v3 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Andrey Grodzovsky
XGMI hive has some resources allocted on device init which needs to be deallocated when the device is unregistered. v2: Remove creation of dedicated wq for XGMI hive reset. v3: Use the gmc.xgmi.supported flag Signed-off-by: Andrey Grodzovsky --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 3 +

[PATCH v3 1/3] drm/amdgpu/psp: Update waiting in psp mode1 reset.

2018-11-30 Thread Andrey Grodzovsky
No point in use mdelay unless running from interrupt context (which we are not) This is busy wait which will block the CPU for the entirety of the wait time. Also, reduce wait time to 500ms as it is done in refernce code because 1s might cause PSP FW TO issues during XGMI hive reset. Signed-off-by

[PATCH v3 3/3] drm/amdgpu: Implement concurrent asic reset for XGMI.

2018-11-30 Thread Andrey Grodzovsky
Use per hive wq to concurrently send reset commands to all nodes in the hive. v2: Switch to system_highpri_wq after dropping dedicated queue. Fix non XGMI code path KASAN error. Stop the hive reset for each node loop if there is a reset failure on any of the nodes. Signed-off-by: Andrey Grodzovs

RE: [PATCH 03/11] drm/amdgpu: remove VM fault_credit handling

2018-11-30 Thread Zeng, Oak
The credit was used to limit vm (retry) fault to be processed in each VM. If this is removed, it is possible that you get flooded interrupt storm. Even though you claimed from the commit message that, printk_ratelimit is a better solution, I didn't see you implement it in this patch. Are you pla

RE: [PATCH 02/11] drm/amdgpu: send IVs to the KFD only after processing them v2

2018-11-30 Thread Zeng, Oak
See comment [Oak] Thanks, Oak -Original Message- From: amd-gfx On Behalf Of Christian König Sent: Friday, November 30, 2018 7:36 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH 02/11] drm/amdgpu: send IVs to the KFD only after processing them v2 This allows us to filter out VM fa

Re: [PATCH] drm/amdgpu: add a xgmi supported flag

2018-11-30 Thread Grodzovsky, Andrey
Reviewed-by: Andrey Grodzovsky Andrey On 11/30/2018 03:36 PM, Alex Deucher wrote: > On Fri, Nov 30, 2018 at 3:34 PM Grodzovsky, Andrey > wrote: >> >> >> On 11/30/2018 03:30 PM, Alex Deucher wrote: >>> Use this to track whether an asic supports xgmi rather than >>> checking the asic type everyw

Re: [PATCH] drm/amdgpu: add a xgmi supported flag

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 3:34 PM Grodzovsky, Andrey wrote: > > > > On 11/30/2018 03:30 PM, Alex Deucher wrote: > > Use this to track whether an asic supports xgmi rather than > > checking the asic type everywhere. > > > > Signed-off-by: Alex Deucher > > --- > > drivers/gpu/drm/amd/amdgpu/amdgpu_

Re: [PATCH] drm/amdgpu: add a xgmi supported flag

2018-11-30 Thread Grodzovsky, Andrey
On 11/30/2018 03:30 PM, Alex Deucher wrote: > Use this to track whether an asic supports xgmi rather than > checking the asic type everywhere. > > Signed-off-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 + > drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c | 4 ++-- > driver

[PATCH] drm/amdgpu: add a xgmi supported flag

2018-11-30 Thread Alex Deucher
Use this to track whether an asic supports xgmi rather than checking the asic type everywhere. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c| 2 +- drivers/gpu/drm/amd/a

Re: [PATCH v2 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 3:12 PM Grodzovsky, Andrey wrote: > > > On 11/30/2018 03:08 PM, Alex Deucher wrote: > > On Fri, Nov 30, 2018 at 3:06 PM Grodzovsky, Andrey > > wrote: > >> > >> > >> On 11/30/2018 02:49 PM, Alex Deucher wrote: > >>> On Fri, Nov 30, 2018 at 1:17 PM Andrey Grodzovsky > >>> w

Re: [PATCH v2 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Grodzovsky, Andrey
On 11/30/2018 03:08 PM, Alex Deucher wrote: > On Fri, Nov 30, 2018 at 3:06 PM Grodzovsky, Andrey > wrote: >> >> >> On 11/30/2018 02:49 PM, Alex Deucher wrote: >>> On Fri, Nov 30, 2018 at 1:17 PM Andrey Grodzovsky >>> wrote: XGMI hive has some resources allocted on device init which nee

Re: [PATCH v2 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 3:06 PM Grodzovsky, Andrey wrote: > > > > On 11/30/2018 02:49 PM, Alex Deucher wrote: > > On Fri, Nov 30, 2018 at 1:17 PM Andrey Grodzovsky > > wrote: > >> XGMI hive has some resources allocted on device init which > >> needs to be deallocated when the device is unregister

Re: [PATCH v2 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Grodzovsky, Andrey
On 11/30/2018 02:49 PM, Alex Deucher wrote: > On Fri, Nov 30, 2018 at 1:17 PM Andrey Grodzovsky > wrote: >> XGMI hive has some resources allocted on device init which >> needs to be deallocated when the device is unregistered. >> >> v2: Remove creation of dedicated wq for XGMI hive reset. >> >>

Re: [PATCH v2 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 1:17 PM Andrey Grodzovsky wrote: > > XGMI hive has some resources allocted on device init which > needs to be deallocated when the device is unregistered. > > v2: Remove creation of dedicated wq for XGMI hive reset. > > Signed-off-by: Andrey Grodzovsky > --- > drivers/gpu

[pull] amdgpu, amdkfd, ttm, scheduler, radeon drm-next-4.21

2018-11-30 Thread Alex Deucher
Hi Dave, More new features for 4.21: amdgpu and amdkfd: - Freesync support - ABM support in DC - KFD support for vega12 and polaris12 - Add sdma paging queue support for vega - Use ACPI to query backlight range on supported platforms - Clean up doorbell handling - KFD fix for pasid handling under

[PATCH v2 3/3] drm/amdgpu: Implement concurrent asic reset for XGMI.

2018-11-30 Thread Andrey Grodzovsky
Use per hive wq to concurrently send reset commands to all nodes in the hive. v2: Switch to system_highpri_wq after dropping dedicated queue. Fix non XGMI code path KASAN error. Stop the hive reset for each node loop if there is a reset failure on any of the nodes. Signed-off-by: Andrey Grodzovs

[PATCH v2 1/3] drm/amdgpu/psp: Update waiting in psp mode1 reset.

2018-11-30 Thread Andrey Grodzovsky
No point in use mdelay unless running from interrupt context (which we are not) This is busy wait which will block the CPU for the entirety of the wait time. Also, reduce wait time to 500ms as it is done in refernce code because it might cause PSP FW timeout issues during XGMI hive reset. Signed-o

[PATCH v2 2/3] drm/amdgpu: Handle xgmi device removal.

2018-11-30 Thread Andrey Grodzovsky
XGMI hive has some resources allocted on device init which needs to be deallocated when the device is unregistered. v2: Remove creation of dedicated wq for XGMI hive reset. Signed-off-by: Andrey Grodzovsky --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 3 +++ drivers/gpu/drm/amd/amdgpu/amdgp

RE: [PATCH 02/11] drm/amdgpu: send IVs to the KFD only after processing them v2

2018-11-30 Thread Kuehling, Felix
Won't this break VM fault handling in KFD? I don't see a way with the current code that you can leave some VM faults for KFD to process. If we could consider VM faults with VMIDs 8-15 as not handled in amdgpu and leave them for KFD to process, then this could work. As far as I can tell, the onl

Re: [PATCH 1/2] drm/amdgpu: Handle xgmi device removal and add reset wq.

2018-11-30 Thread Grodzovsky, Andrey
On 11/30/2018 10:53 AM, Koenig, Christian wrote: > Am 30.11.18 um 16:14 schrieb Grodzovsky, Andrey: >> On 11/30/2018 04:03 AM, Christian König wrote: >>> Am 29.11.18 um 21:36 schrieb Andrey Grodzovsky: XGMI hive has some resources allocted on device init which needs to be deallocated wh

Re: [PATCH 11/11] drm/amdgpu: disable IH ring 1 & 2 WPTR overflow on Vega10

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > That should add back pressure on the client. > > Signed-off-by: Christian König Acked-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/vega10_ih.c | 4 > 1 file changed, 4 insertions(+) > > diff --git a/drivers/gpu/drm/amd/amd

Re: [PATCH 10/11] drm/amdgpu: add support for self irq on Vega10

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > This finally enables processing of ring 1 & 2. > > Signed-off-by: Christian König Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/vega10_ih.c | 68 -- > 1 file changed, 63 insertions(+), 5 deletion

Re: [PATCH 09/11] drm/amdgpu: add support for processing IH ring 1 & 2

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > Previously we only added the ring buffer memory, now add the handling as > well. > > Signed-off-by: Christian König Acked-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 33 + > drivers/gpu/d

Re: [PATCH 05/11] drm/amdgpu: add IH ring to ih_get_wptr/ih_set_rptr v2

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > Let's start to support multiple rings. > > v2: decode IV is needed as well > > Signed-off-by: Christian König > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_ih.c | 6 +-- > drivers/gpu/drm/amd/amdgpu/amdgpu_ih.h | 13 +++--- > drivers/gpu/

Re: [PATCH 04/11] drm/amdgpu: move IV prescreening into the GMC code

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > The GMC/VM subsystem is causing the faults, so move the handling here as > well. > > Signed-off-by: Christian König Acked-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_ih.h | 2 - > drivers/gpu/drm/amd/amdgpu/amdgpu_irq

Re: [PATCH 07/11] drm/amdgpu: enable IH ring 1 and ring 2 v2

2018-11-30 Thread Christian König
Am 30.11.18 um 17:01 schrieb Alex Deucher: On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: The entries are ignored for now, but it at least stops crashing the hardware when somebody tries to push something to the other IH rings. v2: limit ring size, add TODO comment Signed-off-by: Chri

Re: [PATCH 03/11] drm/amdgpu: remove VM fault_credit handling

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > printk_ratelimit() is much better suited to limit the number of reported > VM faults. > > Signed-off-by: Christian König Acked-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c | 37 - > drivers/

Re: [PATCH 02/11] drm/amdgpu: send IVs to the KFD only after processing them v2

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > This allows us to filter out VM faults in the GMC code. > > v2: don't filter out all faults > > Signed-off-by: Christian König Acked-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 29 +++-- > 1

Re: [PATCH 08/11] drm/amdgpu: add the IH to the IV trace

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > To distinct on which IH ring an IV was found. > > Signed-off-by: Christian König Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 4 ++-- > drivers/gpu/drm/amd/amdgpu/amdgpu_trace.h | 11 +++ > 2

Re: [PATCH 07/11] drm/amdgpu: enable IH ring 1 and ring 2 v2

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > The entries are ignored for now, but it at least stops crashing the > hardware when somebody tries to push something to the other IH rings. > > v2: limit ring size, add TODO comment > > Signed-off-by: Christian König We may want to guard

Re: [PATCH 1/2] drm/amdgpu: Handle xgmi device removal and add reset wq.

2018-11-30 Thread Koenig, Christian
Am 30.11.18 um 16:14 schrieb Grodzovsky, Andrey: > > On 11/30/2018 04:03 AM, Christian König wrote: >> Am 29.11.18 um 21:36 schrieb Andrey Grodzovsky: >>> XGMI hive has some resources allocted on device init which >>> needs to be deallocated when the device is unregistered. >>> >>> Add per hive wq

Re: [PATCH 06/11] drm/amdgpu: simplify IH programming

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > Calculate all the addresses and pointers in amdgpu_ih.c > > Signed-off-by: Christian König Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_ih.c | 34 +++ > drivers/gpu/drm/amd/amdgpu/amdgpu_ih.

Re: [PATCH 01/11] drm/amdgpu: add missing error handling

2018-11-30 Thread Alex Deucher
On Fri, Nov 30, 2018 at 7:36 AM Christian König wrote: > > We ignored the return code here. > > Signed-off-by: Christian König Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 3 +++ > 1 file changed, 3 insertions(+) > > diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v9_

Re: [PATCH] drm/amdgpu: Fix num_doorbell calculation issue

2018-11-30 Thread Deucher, Alexander
Reviewed-by: Alex Deucher From: amd-gfx on behalf of Oak Zeng Sent: Friday, November 30, 2018 10:39:21 AM To: amd-gfx@lists.freedesktop.org Cc: Yang, Philip; Zeng, Oak; Yin, Tianci (Rico) Subject: [PATCH] drm/amdgpu: Fix num_doorbell calculation issue When pag

[PATCH] drm/amdgpu: Fix num_doorbell calculation issue

2018-11-30 Thread Oak Zeng
When paging queue is enabled, it use the second page of doorbell. The AMDGPU_DOORBELL64_MAX_ASSIGNMENT definition assumes all the kernel doorbells are in the first page. So with paging queue enabled, the total kernel doorbell range should be original num_doorbell plus one page (0x400 in dword), not

Re: [PATCH] drm/amd/display: Fix NULL ptr deref for commit_planes_to_stream

2018-11-30 Thread Wentland, Harry
On 2018-11-30 10:09 a.m., Nicholas Kazlauskas wrote: > [Why] > With scaling, underscan and abm changes we can end up calling > commit_planes_to_stream in commit_tail. This call uses dm_state->context > which can be NULL if the commit was a fast update. > > [How] > Use dc_state instead since that c

Re: [PATCH] drm/amd/display: Fix NULL ptr deref for commit_planes_to_stream

2018-11-30 Thread Li, Sun peng (Leo)
On 2018-11-30 10:13 a.m., Deucher, Alexander wrote: > Acked-by: Alex Deucher Reviewed-by: Leo Li > > > *From:* amd-gfx on behalf of > Nicholas Kazlauskas > *Sent:* Friday, November 30, 2018 10:09:28 AM > *To:* amd-gf

Re: [PATCH 1/2] drm/amdgpu: Handle xgmi device removal and add reset wq.

2018-11-30 Thread Grodzovsky, Andrey
On 11/30/2018 04:03 AM, Christian König wrote: > Am 29.11.18 um 21:36 schrieb Andrey Grodzovsky: >> XGMI hive has some resources allocted on device init which >> needs to be deallocated when the device is unregistered. >> >> Add per hive wq to allow all the nodes in hive to run resets >> concuren

Re: [PATCH] drm/amd/display: Fix NULL ptr deref for commit_planes_to_stream

2018-11-30 Thread Deucher, Alexander
Acked-by: Alex Deucher From: amd-gfx on behalf of Nicholas Kazlauskas Sent: Friday, November 30, 2018 10:09:28 AM To: amd-gfx@lists.freedesktop.org Cc: Li, Sun peng (Leo); Wentland, Harry; Kazlauskas, Nicholas Subject: [PATCH] drm/amd/display: Fix NULL ptr dere

[PATCH] drm/amd/display: Fix NULL ptr deref for commit_planes_to_stream

2018-11-30 Thread Nicholas Kazlauskas
[Why] With scaling, underscan and abm changes we can end up calling commit_planes_to_stream in commit_tail. This call uses dm_state->context which can be NULL if the commit was a fast update. [How] Use dc_state instead since that can't be NULL unless the system ran out of memory. Bugzilla: https:

Re: [PATCH] drm/amdgpu: remove amdgpu_bo_backup_to_shadow

2018-11-30 Thread Deucher, Alexander
Reviewed-by: Alex Deucher From: amd-gfx on behalf of Christian König Sent: Friday, November 30, 2018 7:45:17 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH] drm/amdgpu: remove amdgpu_bo_backup_to_shadow It is unused. Signed-off-by: Christian König ---

[PATCH 04/16 v2] drm/amd/display: Add tracing to dc

2018-11-30 Thread David Francis
[Why] Tracing is a useful and cheap debug functionality [How] This creates a new trace system amdgpu_dm, currently with three trace events amdgpu_dc_rreg and amdgpu_dc_wreg report the address and value of any dc register reads and writes amdgpu_dc_performance requires at least one of those two t

[PATCH] drm/amdgpu: remove amdgpu_bo_backup_to_shadow

2018-11-30 Thread Christian König
It is unused. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 47 -- drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 5 --- 2 files changed, 52 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/am

[PATCH 05/11] drm/amdgpu: add IH ring to ih_get_wptr/ih_set_rptr v2

2018-11-30 Thread Christian König
Let's start to support multiple rings. v2: decode IV is needed as well Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_ih.c | 6 +-- drivers/gpu/drm/amd/amdgpu/amdgpu_ih.h | 13 +++--- drivers/gpu/drm/amd/amdgpu/cik_ih.c | 29 +++-- drivers/gpu/drm/amd/amdgpu

[PATCH 09/11] drm/amdgpu: add support for processing IH ring 1 & 2

2018-11-30 Thread Christian König
Previously we only added the ring buffer memory, now add the handling as well. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 33 + drivers/gpu/drm/amd/amdgpu/amdgpu_irq.h | 4 ++- 2 files changed, 36 insertions(+), 1 deletion(-) diff --git

[PATCH 06/11] drm/amdgpu: simplify IH programming

2018-11-30 Thread Christian König
Calculate all the addresses and pointers in amdgpu_ih.c Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_ih.c | 34 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ih.h | 23 +--- drivers/gpu/drm/amd/amdgpu/cik_ih.c | 9 +++ drivers/gpu/drm/am

[PATCH 11/11] drm/amdgpu: disable IH ring 1 & 2 WPTR overflow on Vega10

2018-11-30 Thread Christian König
That should add back pressure on the client. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/vega10_ih.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/vega10_ih.c b/drivers/gpu/drm/amd/amdgpu/vega10_ih.c index f5c5ea628fdf..dd7f52f08fd7 100644

[PATCH 07/11] drm/amdgpu: enable IH ring 1 and ring 2 v2

2018-11-30 Thread Christian König
The entries are ignored for now, but it at least stops crashing the hardware when somebody tries to push something to the other IH rings. v2: limit ring size, add TODO comment Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_irq.h | 4 +- drivers/gpu/drm/amd/amdgpu/vega10_

[PATCH 02/11] drm/amdgpu: send IVs to the KFD only after processing them v2

2018-11-30 Thread Christian König
This allows us to filter out VM faults in the GMC code. v2: don't filter out all faults Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 29 +++-- 1 file changed, 17 insertions(+), 12 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ir

[PATCH 10/11] drm/amdgpu: add support for self irq on Vega10

2018-11-30 Thread Christian König
This finally enables processing of ring 1 & 2. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/vega10_ih.c | 68 -- 1 file changed, 63 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/vega10_ih.c b/drivers/gpu/drm/amd/amdgpu/vega10_ih.

[PATCH 01/11] drm/amdgpu: add missing error handling

2018-11-30 Thread Christian König
We ignored the return code here. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 3 +++ 1 file changed, 3 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c index 3a4e5d8d5162..e329a23e1f99 100644 --- a/drivers/gp

[PATCH 04/11] drm/amdgpu: move IV prescreening into the GMC code

2018-11-30 Thread Christian König
The GMC/VM subsystem is causing the faults, so move the handling here as well. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_ih.h | 2 - drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 4 -- drivers/gpu/drm/amd/amdgpu/cik_ih.c | 13 drivers/gpu/drm/amd/amdgpu/cz_ih.c

[PATCH 03/11] drm/amdgpu: remove VM fault_credit handling

2018-11-30 Thread Christian König
printk_ratelimit() is much better suited to limit the number of reported VM faults. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c | 37 - drivers/gpu/drm/amd/amdgpu/amdgpu_vm.h | 5 drivers/gpu/drm/amd/amdgpu/cik_ih.c | 18 +

[PATCH 08/11] drm/amdgpu: add the IH to the IV trace

2018-11-30 Thread Christian König
To distinct on which IH ring an IV was found. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/amdgpu_trace.h | 11 +++ 2 files changed, 9 insertions(+), 6 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c

Re: [PATCH] drm/amdgpu: enlarge maximum waiting time of KIQ

2018-11-30 Thread Christian König
Am 30.11.18 um 10:48 schrieb wentalou: SWDEV-171843: KIQ in VF’s init delayed by another VF’s reset. late_init failed occasionally if overlapped with another VF’s reset. MAX_KIQ_REG_TRY enlarged from 20 to 80 would fix this issue. Change-Id: I841774bdd9ebf125c5aa2046b1dcebd65e07 Signed-off-b

[PATCH] drm/amdgpu: enlarge maximum waiting time of KIQ

2018-11-30 Thread wentalou
SWDEV-171843: KIQ in VF’s init delayed by another VF’s reset. late_init failed occasionally if overlapped with another VF’s reset. MAX_KIQ_REG_TRY enlarged from 20 to 80 would fix this issue. Change-Id: I841774bdd9ebf125c5aa2046b1dcebd65e07 Signed-off-by: wentalou --- drivers/gpu/drm/amd/amd

RE: [PATCH libdrm 4/5] wrap syncobj timeline query/wait APIs for amdgpu v3

2018-11-30 Thread Zhou, David(ChunMing)
> -Original Message- > From: Christian König > Sent: Friday, November 30, 2018 5:15 PM > To: Zhou, David(ChunMing) ; dri- > de...@lists.freedesktop.org; amd-gfx@lists.freedesktop.org > Subject: Re: [PATCH libdrm 4/5] wrap syncobj timeline query/wait APIs for > amdgpu v3 > [snip] > >> +d

Re: [PATCH libdrm 4/5] wrap syncobj timeline query/wait APIs for amdgpu v3

2018-11-30 Thread Christian König
Am 30.11.18 um 08:35 schrieb zhoucm1: On 2018年11月28日 22:50, Christian König wrote: From: Chunming Zhou v2: symbos are stored in lexical order. v3: drop export/import and extra query indirection Signed-off-by: Chunming Zhou Signed-off-by: Christian König ---   amdgpu/amdgpu-symbol-check | 

Re: [PATCH] drm/amdgpu: add VCN JPEG support amdgpu_ctx_num_entities

2018-11-30 Thread Christian König
Am 30.11.18 um 03:00 schrieb Alex Deucher: Looks like it was missed when setting support was added. Signed-off-by: Alex Deucher Reviewed-by: Christian König --- This is a legit bug fix. the rest of this series needs more work. drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c | 1 + 1 file chang

Re: [PATCH 1/2] drm/amdgpu: Handle xgmi device removal and add reset wq.

2018-11-30 Thread Christian König
Am 29.11.18 um 21:36 schrieb Andrey Grodzovsky: XGMI hive has some resources allocted on device init which needs to be deallocated when the device is unregistered. Add per hive wq to allow all the nodes in hive to run resets concurently - this should speed up the total reset time to avoid breach