RE: [PATCH Review 2/2] drm/amdgpu: message smu to update bad channel info

2022-03-03 Thread Zhou1, Tao
[AMD Official Use Only] The series is: Reviewed-by: Tao Zhou > -Original Message- > From: Stanley.Yang > Sent: Friday, March 4, 2022 2:51 PM > To: amd-gfx@lists.freedesktop.org; Zhang, Hawking > ; Zhou1, Tao ; Joo, Maria > > Cc: Yang, Stanley > Subject: [PATCH Review 2/2]

[PATCH Review 2/2] drm/amdgpu: message smu to update bad channel info

2022-03-03 Thread Stanley . Yang
It should notice SMU to update bad channel info when detected uncorrectable error in UMC block Change-Id: I2dc8848affdb53e52891013953ae9383fff5f20f Signed-off-by: Stanley.Yang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 7 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +++

[PATCH Review 1/2] drm/amd/pm: add send bad channel info function

2022-03-03 Thread Stanley . Yang
support message SMU update bad channel info to update HBM bad channel info in OOB table Change-Id: I1e50ed8118f4c1aaefb04c040e59ae4918cdc295 Signed-off-by: Stanley.Yang --- drivers/gpu/drm/amd/pm/amdgpu_dpm.c | 12 ++ drivers/gpu/drm/amd/pm/inc/amdgpu_dpm.h | 1 +

Re: [RFC v4 02/11] drm/amdgpu: Move scheduler init to after XGMI is ready

2022-03-03 Thread Chen, JingWen
Thanks a lot Best Regards, JingWen Chen > On Mar 4, 2022, at 00:36, Grodzovsky, Andrey > wrote: > > I pushed all the changes including your patch. > > Andrey > > On 2022-03-02 22:16, Andrey Grodzovsky wrote: >> OK, i will do quick smoke test tomorrow and push all of it it then. >> >>

2022 X.Org Board of Directors Elections timeline extended, Request for nominations

2022-03-03 Thread Lyude Paul
We are seeking nominations for candidates for election to the X.org Foundation Board of Directors. However, as we presently do not have enough nominations to start the election - the decision has been made to extend the timeline by 2 weeks. Note this is a fairly regular part of the elections

Re: [PATCH][next] drm/amd/display: Fix Wstringop-overflow warnings in dc_link_dp.c

2022-03-03 Thread Gustavo A. R. Silva
On Thu, Mar 03, 2022 at 12:19:57PM -0600, Gustavo A. R. Silva wrote: > On Thu, Mar 03, 2022 at 09:43:28AM -0800, Kees Cook wrote: > > On Thu, Mar 03, 2022 at 11:25:03AM -0600, Gustavo A. R. Silva wrote: > > > Fix the following Wstringop-overflow warnings when building with GCC-11: > > > > > >

Re: [PATCH 08/10] drm/amdgpu: initialize the vmid_wait with the stub fence

2022-03-03 Thread Andrey Grodzovsky
Reviewed-by: Andrey Grodzovsky Andrey On 2022-03-03 03:23, Christian König wrote: This way we don't need to check for NULL any more. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_ids.c | 2 +- drivers/gpu/drm/amd/amdgpu/amdgpu_ring.c | 1 + 2 files changed, 2

Re: [PATCH 06/10] drm/amdgpu: properly imbed the IBs into the job

2022-03-03 Thread Andrey Grodzovsky
Reviewed-by: Andrey Grodzovsky Andrey On 2022-03-03 03:23, Christian König wrote: We now have standard macros for that. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_job.c | 7 +-- drivers/gpu/drm/amd/amdgpu/amdgpu_job.h | 6 -- 2 files changed, 5

[PATCH] drm/selftests: fix a shift-out-of-bounds bug

2022-03-03 Thread Arunpravin
pass the correct size value computed using the max_order. [ 68.124177][ T1] UBSAN: shift-out-of-bounds in include/linux/log2.h:67:13 [ 68.125333][ T1] shift exponent 4294967295 is too large for 32-bit type 'long unsigned int' [ 68.126563][ T1] CPU: 0 PID: 1 Comm: swapper Not tainted

Re: [PATCH 05/10] drm/amdgpu: use job and ib structures directly in CS parsers

2022-03-03 Thread Andrey Grodzovsky
Acked-by: Andrey Grodzovsky Andrey On 2022-03-03 03:23, Christian König wrote: Instead of providing the ib index provide the job and ib pointers directly to the patch and parse functions for UVD and VCE. Also move the set/get functions for IB values to the IB declerations. Signed-off-by:

Re: [PATCH 02/10] drm/amdgpu: header cleanup

2022-03-03 Thread Andrey Grodzovsky
Acked-by: Andrey Grodzovsky Andrey On 2022-03-03 03:23, Christian König wrote: No function change, just move a bunch of definitions from amdgpu.h into separate header files. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 95 ---

Re: [PATCH 01/10] drm/amdgpu: install ctx entities with cmpxchg

2022-03-03 Thread Andrey Grodzovsky
Reviewed-by: Andrey Grodzovsky Andrey On 2022-03-03 03:22, Christian König wrote: Since we removed the context lock we need to make sure that not two threads are trying to install an entity at the same time. Signed-off-by: Christian König Fixes: e68efb27647f ("drm/amdgpu: remove ctx->lock")

Re: [PATCH][next] drm/amd/display: Fix Wstringop-overflow warnings in dc_link_dp.c

2022-03-03 Thread Harry Wentland
On 2022-03-03 12:25, Gustavo A. R. Silva wrote: > Fix the following Wstringop-overflow warnings when building with GCC-11: > > drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link_dpia.c:493:17: warning: > ‘dp_decide_lane_settings’ accessing 4 bytes in a region of size 1 >

Re: [PATCH][next] drm/amd/display: Fix Wstringop-overflow warnings in dc_link_dp.c

2022-03-03 Thread Gustavo A. R. Silva
On Thu, Mar 03, 2022 at 09:43:28AM -0800, Kees Cook wrote: > On Thu, Mar 03, 2022 at 11:25:03AM -0600, Gustavo A. R. Silva wrote: > > Fix the following Wstringop-overflow warnings when building with GCC-11: > > > > drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link_dpia.c:493:17: > > warning:

Re: [PATCH][next] drm/amd/display: Fix Wstringop-overflow warnings in dc_link_dp.c

2022-03-03 Thread Kees Cook
On Thu, Mar 03, 2022 at 11:25:03AM -0600, Gustavo A. R. Silva wrote: > Fix the following Wstringop-overflow warnings when building with GCC-11: > > drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link_dpia.c:493:17: warning: > ‘dp_decide_lane_settings’ accessing 4 bytes in a region of size 1 >

[PATCH][next] drm/amd/display: Fix Wstringop-overflow warnings in dc_link_dp.c

2022-03-03 Thread Gustavo A. R. Silva
Fix the following Wstringop-overflow warnings when building with GCC-11: drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link_dpia.c:493:17: warning: ‘dp_decide_lane_settings’ accessing 4 bytes in a region of size 1 [-Wstringop-overflow=]

Re: [RFC v4 02/11] drm/amdgpu: Move scheduler init to after XGMI is ready

2022-03-03 Thread Andrey Grodzovsky
I pushed all the changes including your patch. Andrey On 2022-03-02 22:16, Andrey Grodzovsky wrote: OK, i will do quick smoke test tomorrow and push all of it it then. Andrey On 2022-03-02 21:59, Chen, JingWen wrote: Hi Andrey, I don't have the bare mental environment, I can only test the

RE: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-03 Thread Liu, Shaoyun
[AMD Official Use Only] Reviewed by : Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Thursday, March 3, 2022 11:25 AM To: amd-gfx@lists.freedesktop.org Cc: Yu, David Subject: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran Add DFC CAP support for

[PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-03 Thread David Yu
Add DFC CAP support for aldebaran Initialize cap microcode in psp_init_sriov_microcode, the ta microcode will be initialized in psp_vxx_init_microcode Signed-off-by: David Yu --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c | 2 +- drivers/gpu/drm/amd/amdgpu/psp_v13_0.c | 1 + 2 files changed,

Re: [PATCH 2/2] drm/amdkfd: implement get_atc_vmid_pasid_mapping_info for gfx10.3

2022-03-03 Thread Deucher, Alexander
[Public] Series is: Reviewed-by: Alex Deucher From: amd-gfx on behalf of Yifan Zhang Sent: Thursday, March 3, 2022 3:05 AM To: amd-gfx@lists.freedesktop.org Cc: Zhang, Yifan ; Kuehling, Felix Subject: [PATCH 2/2] drm/amdkfd: implement

Re: [PATCH 1/1] drm/amdkfd: Improve concurrency of event handling

2022-03-03 Thread Felix Kuehling
Am 2022-03-03 um 02:25 schrieb Christian König: Am 02.03.22 um 21:06 schrieb Felix Kuehling: Use rcu_read_lock to read p->event_idr concurrently with other readers and writers. Use p->event_mutex only for creating and destroying events and in kfd_wait_on_events. That might not necessary

RE: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-03 Thread Liu, Shaoyun
[AMD Official Use Only] Probably just described as follows : Initialize cap microcode in psp_init_sriov_microcode, the ta microcode will be initialized in psp_vxx_init_microcode -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Thursday, March 3, 2022 9:10 AM To:

RE: [PATCH 1/2] drm/amdgpu/vcn: Update fw shared data structure

2022-03-03 Thread Liu, Leo
[AMD Official Use Only] The series are: Reviewed-by: Leo Liu -Original Message- From: Dong, Ruijing Sent: March 2, 2022 4:25 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Leo ; Deucher, Alexander Subject: [PATCH 1/2] drm/amdgpu/vcn: Update fw shared data structure Add fw log in fw

[PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-03 Thread David Yu
Add DFC CAP support for aldebaran Changed incorrect call to psp_init_ta_microcode in psp_init_sriov_microcode to psp_init_cap_microcode which caused it to fail even with correct CAP firmware. Signed-off-by: David Yu --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c | 2 +-

RE: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread David Laight
From: Xiaomeng Tong > Sent: 03 March 2022 07:27 > > On Thu, 3 Mar 2022 04:58:23 +, David Laight wrote: > > on 3 Mar 2022 10:27:29 +0800, Xiaomeng Tong wrote: > > > The problem is the mis-use of iterator outside the loop on exit, and > > > the iterator will be the HEAD's container_of pointer

RE: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Xiaomeng Tong
> From: Xiaomeng Tong > > Sent: 03 March 2022 07:27 > > > > On Thu, 3 Mar 2022 04:58:23 +, David Laight wrote: > > > on 3 Mar 2022 10:27:29 +0800, Xiaomeng Tong wrote: > > > > The problem is the mis-use of iterator outside the loop on exit, and > > > > the iterator will be the HEAD's

Re: [Kgdb-bugreport] [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Daniel Thompson
On Thu, Mar 03, 2022 at 03:26:57PM +0800, Xiaomeng Tong wrote: > On Thu, 3 Mar 2022 04:58:23 +, David Laight wrote: > > on 3 Mar 2022 10:27:29 +0800, Xiaomeng Tong wrote: > > > The problem is the mis-use of iterator outside the loop on exit, and > > > the iterator will be the HEAD's

Re: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Xiaomeng Tong
correct for typo: -for (struct list_head *list = head->next, cond = (struct list_head *)-1; cond == (struct list_head *)-1; cond = NULL) \ +for (struct list_head *list = head->next, *cond = (struct list_head *)-1; cond == (struct list_head *)-1; cond = NULL) \ -- Xiaomeng Tong

Re: [PATCH v9] drm/amdgpu: add drm buddy support to amdgpu

2022-03-03 Thread Christian König
Am 01.03.22 um 21:38 schrieb Arunpravin: - Remove drm_mm references and replace with drm buddy functionalities - Add res cursor support for drm buddy v2(Matthew Auld): - replace spinlock with mutex as we call kmem_cache_zalloc (..., GFP_KERNEL) in drm_buddy_alloc() function -

Re: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Dan Carpenter
On Wed, Mar 02, 2022 at 10:29:31AM +0100, Rasmus Villemoes wrote: > This won't help the current issue (because it doesn't exist and might > never), but just in case some compiler people are listening, I'd like to > have some sort of way to tell the compiler "treat this variable as > uninitialized

Re: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Dan Carpenter
On Wed, Mar 02, 2022 at 12:07:04PM -0800, Kees Cook wrote: > On Wed, Mar 02, 2022 at 10:29:31AM +0100, Rasmus Villemoes wrote: > > This won't help the current issue (because it doesn't exist and might > > never), but just in case some compiler people are listening, I'd like to > > have some sort

[PATCH] drm: Remove redundant code

2022-03-03 Thread Jiapeng Chong
Clean up the following smatch warning: drivers/gpu/drm/radeon/atom.c:400 atom_skip_src_int() warn: ignoring unreachable code. Reported-by: Abaci Robot Signed-off-by: Jiapeng Chong --- drivers/gpu/drm/radeon/atom.c | 1 - 1 file changed, 1 deletion(-) diff --git

Re: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Xiaomeng Tong
> I think this would make sense, it would mean you only assign the containing > element on valid elements. > > I was thinking something along the lines of: > > #define list_for_each_entry(pos, head, member) > \ > for (struct list_head *list =

RE: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Xiaomeng Tong
On Thu, 3 Mar 2022 04:58:23 +, David Laight wrote: > on 3 Mar 2022 10:27:29 +0800, Xiaomeng Tong wrote: > > The problem is the mis-use of iterator outside the loop on exit, and > > the iterator will be the HEAD's container_of pointer which pointers > > to a type-confused struct. Sidenote: The

Re: [PATCH 2/6] treewide: remove using list iterator after loop body as a ptr

2022-03-03 Thread Jakob Koschel
> On 3. Mar 2022, at 05:58, David Laight wrote: > > From: Xiaomeng Tong >> Sent: 03 March 2022 02:27 >> >> On Wed, 2 Mar 2022 14:04:06 +, David Laight >> wrote: >>> I think that it would be better to make any alternate loop macro >>> just set the variable to NULL on the loop exit. >>>

[PATCH 10/10] drm/amdgpu: add gang submit frontend

2022-03-03 Thread Christian König
Allows submitting jobs as gang which needs to run on multiple engines at the same time. All members of the gang get the same implicit, explicit and VM dependencies. So no gang member will start running until everything else is ready. The last job is considered the gang leader (usually a

[PATCH 07/10] drm/amdgpu: move setting the job resources

2022-03-03 Thread Christian König
Move setting the job resources into amdgpu_job.c Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 21 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_job.c | 17 + drivers/gpu/drm/amd/amdgpu/amdgpu_job.h | 2 ++ 3 files changed, 21

[PATCH 09/10] drm/amdgpu: add gang submit backend

2022-03-03 Thread Christian König
Allows submitting jobs as gang which needs to run on multiple engines at the same time. Basic idea is that we have a global gang submit fence representing when the gang leader is finally pushed to run on the hardware last. Jobs submitted as gang are never re-submitted in case of a GPU reset

[PATCH 08/10] drm/amdgpu: initialize the vmid_wait with the stub fence

2022-03-03 Thread Christian König
This way we don't need to check for NULL any more. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_ids.c | 2 +- drivers/gpu/drm/amd/amdgpu/amdgpu_ring.c | 1 + 2 files changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ids.c

[PATCH 06/10] drm/amdgpu: properly imbed the IBs into the job

2022-03-03 Thread Christian König
We now have standard macros for that. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_job.c | 7 +-- drivers/gpu/drm/amd/amdgpu/amdgpu_job.h | 6 -- 2 files changed, 5 insertions(+), 8 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_job.c

[PATCH 05/10] drm/amdgpu: use job and ib structures directly in CS parsers

2022-03-03 Thread Christian König
Instead of providing the ib index provide the job and ib pointers directly to the patch and parse functions for UVD and VCE. Also move the set/get functions for IB values to the IB declerations. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 6 +-

[PATCH 04/10] drm/amdgpu: remove SRIOV and MCBP dependencies from the CS

2022-03-03 Thread Christian König
We should not have any different CS constrains based on the execution environment. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c

[PATCH 03/10] drm/amdgpu: cleanup and reorder amdgpu_cs.c

2022-03-03 Thread Christian König
Sort the functions in the order they are called and cleanup the coding style and function names to represent the data they process. Check the size of the IB chunk, initialize resulting entity and scheduler job much earlier as well. Signed-off-by: Christian König ---

[PATCH 02/10] drm/amdgpu: header cleanup

2022-03-03 Thread Christian König
No function change, just move a bunch of definitions from amdgpu.h into separate header files. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 95 --- drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c| 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_cs.h

[PATCH 01/10] drm/amdgpu: install ctx entities with cmpxchg

2022-03-03 Thread Christian König
Since we removed the context lock we need to make sure that not two threads are trying to install an entity at the same time. Signed-off-by: Christian König Fixes: e68efb27647f ("drm/amdgpu: remove ctx->lock") --- drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c | 8 +++- 1 file changed, 7

Gang submit

2022-03-03 Thread Christian König
Hi guys, this patch set implements the the requirement for so called gang submissions in the CS interface. A gang submission guarantees that multiple IBs can run on different engines at the same time. This is implemented by keeping a global per-device gang around represented by a dma_fence

[PATCH 2/2] drm/amdkfd: implement get_atc_vmid_pasid_mapping_info for gfx10.3

2022-03-03 Thread Yifan Zhang
This patch implements get_atc_vmid_pasid_mapping_info for gfx10.3 Signed-off-by: Yifan Zhang --- .../gpu/drm/amd/amdgpu/amdgpu_amdkfd_gfx_v10_3.c | 16 +++- 1 file changed, 15 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gfx_v10_3.c

[PATCH 1/2] drm/amdkfd: judge get_atc_vmid_pasid_mapping_info before call

2022-03-03 Thread Yifan Zhang
Fix the NULL point issue: [ 3076.255609] BUG: kernel NULL pointer dereference, address: [ 3076.255624] #PF: supervisor instruction fetch in kernel mode [ 3076.255637] #PF: error_code(0x0010) - not-present page [ 3076.255649] PGD 0 P4D 0 [ 3076.255660] Oops: 0010 [#1] SMP NOPTI [