Re: [PATCH RFC 4/4] bpf,cgroup,perf: extend bpf-cgroup to support tracepoint attachment

2021-11-18 Thread Kenny Ho
On Thu, Nov 18, 2021 at 11:33 PM Alexei Starovoitov wrote: > > On Thu, Nov 18, 2021 at 03:28:40PM -0500, Kenny Ho wrote: > > + for_each_possible_cpu(cpu) { > > + /* allocate first, connect the cgroup later */ > > + events[i] = perf_event_create_kernel_counter(attr,

RE: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function

2021-11-18 Thread Liu, Shaoyun
[AMD Official Use Only] Ping -Original Message- From: Liu, Shaoyun Sent: Thursday, November 18, 2021 11:58 AM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function For sriov XGMI configuration, the host

Re: [PATCH 1/2] drm/amdgpu/gfx10: add wraparound gpu counter check for APUs as well

2021-11-18 Thread Luben Tuikov
Seems reasonable. Series is Acked-by: Luben Tuikov Regards, Luben On 2021-11-18 14:54, Alex Deucher wrote: > Apply the same check we do for dGPUs for APUs as well. > > Signed-off-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 15 +-- > 1 file changed, 13

Re: [PATCH] drm/amdgpu/pm: clean up some inconsistent indenting

2021-11-18 Thread Alex Deucher
Applied. Thanks! Alex On Thu, Nov 18, 2021 at 5:57 AM Jiapeng Chong wrote: > > Eliminate the follow smatch warning: > > drivers/gpu/drm/amd/amdgpu/../pm/powerplay/amd_powerplay.c:1554 > pp_asic_reset_mode_2() warn: inconsistent indenting. > > Reported-by: Abaci Robot > Signed-off-by: Jiapeng

Re: [radeon] connector_info_from_object_table

2021-11-18 Thread Alex Deucher
On Thu, Nov 18, 2021 at 11:37 AM Amol wrote: > > Hello, > > The function radeon_get_atom_connector_info_from_object_table, > at location [1], ends up parsing ATOM_COMMON_TABLE_HEADER > as ATOM_COMMON_RECORD_HEADER if > enc_obj->asObjects[k].usRecordOffset is zero. It is found to be zero > in the

Re: [PATCH] drm/amdgpu: Declare Unpin BO api as static

2021-11-18 Thread Deucher, Alexander
[Public] Reviewed-by: Alex Deucher From: amd-gfx on behalf of Ramesh Errabolu Sent: Thursday, November 18, 2021 5:11 PM To: amd-gfx@lists.freedesktop.org Cc: Errabolu, Ramesh ; kernel test robot Subject: [PATCH] drm/amdgpu: Declare Unpin BO api as static

[PATCH] drm/amdgpu: Declare Unpin BO api as static

2021-11-18 Thread Ramesh Errabolu
Fixes warning report from kernel test robot Reported-by: kernel test robot Signed-off-by: Ramesh Errabolu --- drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c

RE: [PATCH] drm/amdkfd: Remove unused entries in table

2021-11-18 Thread Sider, Graham
[AMD Official Use Only] Reviewed-by: Graham Sider > Remove unused entries in kfd_device_info table: num_xgmi_sdma_engines > and num_sdma_queues_per_engine. They are calculated in > kfd_get_num_sdma_engines and kfd_get_num_xgmi_sdma_engines > instead. > > Signed-off-by: Amber Lin > --- >

[PATCH RFC 0/4] Add ability to attach bpf programs to a tracepoint inside a cgroup

2021-11-18 Thread Kenny Ho
Per an earlier discussion last year[1], I have been looking for a mechanism to a) collect resource usages for devices (GPU for now but there could be other device type in the future) and b) possibly enforce some of the resource usages. An obvious mechanism was to use cgroup but there are too

[PATCH RFC 2/4] bpf, perf: add ability to attach complete array of bpf prog to perf event

2021-11-18 Thread Kenny Ho
Change-Id: Ie2580c3a71e2a5116551879358cb5304b04d3838 Signed-off-by: Kenny Ho --- include/linux/trace_events.h | 9 + kernel/trace/bpf_trace.c | 28 2 files changed, 37 insertions(+) diff --git a/include/linux/trace_events.h

[PATCH RFC 1/4] cgroup, perf: Add ability to connect to perf cgroup from other cgroup controller

2021-11-18 Thread Kenny Ho
This provides the ability to allocate cgroup specific perf_event by bpf-cgroup in later patch Change-Id: I13aa7f3dfc2883ba3663c0b94744a6169504bbd8 Signed-off-by: Kenny Ho --- include/linux/cgroup.h | 2 ++ include/linux/perf_event.h | 2 ++ kernel/cgroup/cgroup.c | 4 ++--

[PATCH] drm/amdkfd: Remove unused entries in table

2021-11-18 Thread Amber Lin
Remove unused entries in kfd_device_info table: num_xgmi_sdma_engines and num_sdma_queues_per_engine. They are calculated in kfd_get_num_sdma_engines and kfd_get_num_xgmi_sdma_engines instead. Signed-off-by: Amber Lin --- drivers/gpu/drm/amd/amdkfd/kfd_device.c | 58 -

[PATCH 1/2] drm/amdgpu/gfx10: add wraparound gpu counter check for APUs as well

2021-11-18 Thread Alex Deucher
Apply the same check we do for dGPUs for APUs as well. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 15 +-- 1 file changed, 13 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c

[PATCH 2/2] drm/amdgpu/gfx9: switch to golden tsc registers for renoir+

2021-11-18 Thread Alex Deucher
Renoir and newer gfx9 APUs have new TSC register that is not part of the gfxoff tile, so it can be read without needing to disable gfx off. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 46 --- 1 file changed, 35 insertions(+), 11 deletions(-)

Re: [PATCH v1 1/9] mm: add zone device coherent type memory support

2021-11-18 Thread Felix Kuehling
Am 2021-11-18 um 1:53 a.m. schrieb Alistair Popple: > On Tuesday, 16 November 2021 6:30:18 AM AEDT Alex Sierra wrote: >> Device memory that is cache coherent from device and CPU point of view. >> This is used on platforms that have an advanced system bus (like CAPI >> or CXL). Any page of a

[RFC 3/3] drm/amd/pm: Add debugfs info for STB

2021-11-18 Thread Andrey Grodzovsky
Add debugfs hook. Signed-off-by: Andrey Grodzovsky Reviewed-by: Lijo Lazar Reviewed-by: Luben Tuikov --- drivers/gpu/drm/amd/pm/amdgpu_pm.c| 2 + drivers/gpu/drm/amd/pm/inc/amdgpu_smu.h | 1 + drivers/gpu/drm/amd/pm/swsmu/amdgpu_smu.c | 86 +++ 3 files changed,

[RFC 2/3] drm/amd/pm: Add STB support in sienna_cichlid

2021-11-18 Thread Andrey Grodzovsky
Add STB implementation for sienna_cichlid Signed-off-by: Andrey Grodzovsky Reviewed-by: Lijo Lazar Reviewed-by: Luben Tuikov --- .../amd/include/asic_reg/mp/mp_11_0_offset.h | 7 +++ .../amd/include/asic_reg/mp/mp_11_0_sh_mask.h | 12 .../amd/pm/swsmu/smu11/sienna_cichlid_ppt.c | 55

[RFC 1/3] drm/amd/pm: Add STB accessors interface

2021-11-18 Thread Andrey Grodzovsky
Add interface to collect STB logs. Signed-off-by: Andrey Grodzovsky Reviewed-by: Lijo Lazar Reviewed-by: Luben Tuikov --- drivers/gpu/drm/amd/pm/inc/amdgpu_smu.h | 15 +++ drivers/gpu/drm/amd/pm/swsmu/amdgpu_smu.c | 18 ++ 2 files changed, 33 insertions(+) diff

[RFC 0/3] Add Smart Trace Buffers support

2021-11-18 Thread Andrey Grodzovsky
The Smart Trace Buffer (STB), is a cyclic data buffer used to log information about system execution for characterization and debug purposes. If at any point should a system encounter a functional failure the trace can be collected without need for reproducing the failure while running

RE: [PATCH] drm/amd/display: Drop config guard for DC_LOG_DP2

2021-11-18 Thread Ma, Leo
[AMD Official Use Only] Agree. Patch applied. Thanks for your review -Leo -Original Message- From: Alex Deucher Sent: Thursday, November 18, 2021 12:02 PM To: Ma, Leo Cc: Kazlauskas, Nicholas ; amd-gfx list ; Deucher, Alexander ; Choi, Nicholas ; Wentland, Harry Subject: Re:

Re: [PATCH] drm/amd/display: Drop config guard for DC_LOG_DP2

2021-11-18 Thread Alex Deucher
On Thu, Nov 18, 2021 at 11:53 AM Leo (Hanghong) Ma wrote: > > [Why & How] > It doesn't make sense to guard DC_LOG_DP2 by CONFIG_DRM_AMD_DCN, and > this also caused build failure for allmodconfig; So drop the guard > to fix the compile failure; > > Signed-off-by: Leo (Hanghong) Ma Reviewed-by:

[PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function

2021-11-18 Thread shaoyunl
For sriov XGMI configuration, the host driver will handle the hive reset, so in guest side, the reset_sriov only be called once on one device. This will make kfd post_reset unblanced with kfd pre_reset since kfd pre_reset already been moved out of reset_sriov function. Move kfd post_reset out of

[PATCH] drm/amd/display: Drop config guard for DC_LOG_DP2

2021-11-18 Thread Leo (Hanghong) Ma
[Why & How] It doesn't make sense to guard DC_LOG_DP2 by CONFIG_DRM_AMD_DCN, and this also caused build failure for allmodconfig; So drop the guard to fix the compile failure; Signed-off-by: Leo (Hanghong) Ma --- drivers/gpu/drm/amd/display/include/logger_types.h | 4 1 file changed, 4

Re: [PATCH 3/3] drm/amdkfd: simplify drain retry fault

2021-11-18 Thread philip yang
On 2021-11-18 11:39 a.m., Felix Kuehling wrote: Am 2021-11-18 um 11:19 a.m. schrieb philip yang: On 2021-11-17 7:14 p.m., Felix Kuehling wrote: On 2021-11-16 10:43 p.m., Philip Yang wrote: unmap

Re: [PATCH 3/3] drm/amdkfd: simplify drain retry fault

2021-11-18 Thread Felix Kuehling
Am 2021-11-18 um 11:19 a.m. schrieb philip yang: > > > On 2021-11-17 7:14 p.m., Felix Kuehling wrote: >> >> On 2021-11-16 10:43 p.m., Philip Yang wrote: >>> unmap range always set svms->drain_pagefaults flag to simplify both >>> parent range and child range unmap. Deferred list work takes mmap

[radeon] connector_info_from_object_table

2021-11-18 Thread Amol
Hello, The function radeon_get_atom_connector_info_from_object_table, at location [1], ends up parsing ATOM_COMMON_TABLE_HEADER as ATOM_COMMON_RECORD_HEADER if enc_obj->asObjects[k].usRecordOffset is zero. It is found to be zero in the BIOS found at [2]. Thankfully, the loop that follows exits

Re: [PATCH 3/3] drm/amdkfd: simplify drain retry fault

2021-11-18 Thread philip yang
On 2021-11-17 7:14 p.m., Felix Kuehling wrote: On 2021-11-16 10:43 p.m., Philip Yang wrote: unmap range always set svms->drain_pagefaults flag to simplify both parent range and child range unmap. Deferred list

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Alex Deucher
On Thu, Nov 18, 2021 at 10:01 AM Lazar, Lijo wrote: > > [Public] > > > BTW, I'm not sure if 'reset always' on resume is a good idea for GPUs in a > hive (assuming those systems also get suspended and get hiccups). At this > point the hive isn't reinitialized. Yeah, we should probably not

Re: [PATCH 2/3] drm/amdkfd: handle VMA remove race

2021-11-18 Thread Felix Kuehling
Am 2021-11-18 um 10:55 a.m. schrieb philip yang: > > > On 2021-11-18 10:07 a.m., Felix Kuehling wrote: >> Am 2021-11-18 um 10:00 a.m. schrieb philip yang: >>> On 2021-11-17 7:10 p.m., Felix Kuehling wrote: On 2021-11-16 10:43 p.m., Philip Yang wrote: > VMA may be removed before unmap

Re: [PATCH 2/3] drm/amdkfd: handle VMA remove race

2021-11-18 Thread philip yang
On 2021-11-18 10:07 a.m., Felix Kuehling wrote: Am 2021-11-18 um 10:00 a.m. schrieb philip yang: On 2021-11-17 7:10 p.m., Felix Kuehling wrote: On 2021-11-16 10:43 p.m., Philip Yang wrote: VMA may

Re: [PATCH v10 08/10] dyndbg: add print-to-tracefs, selftest with it - RFC

2021-11-18 Thread Pekka Paalanen
On Thu, 18 Nov 2021 09:29:27 -0500 Jason Baron wrote: > On 11/16/21 3:46 AM, Pekka Paalanen wrote: > > On Fri, 12 Nov 2021 10:08:41 -0500 > > Jason Baron wrote: > > > >> On 11/12/21 6:49 AM, Vincent Whitchurch wrote: > >>> On Thu, Nov 11, 2021 at 03:02:04PM -0700, Jim Cromie wrote: >

Re: [PATCH 2/3] drm/amdkfd: handle VMA remove race

2021-11-18 Thread Felix Kuehling
Am 2021-11-18 um 10:00 a.m. schrieb philip yang: > > > On 2021-11-17 7:10 p.m., Felix Kuehling wrote: >> On 2021-11-16 10:43 p.m., Philip Yang wrote: >>> VMA may be removed before unmap notifier callback, restore pages take >>> mmap write lock to lookup VMA to avoid race, >> >> The old code looked

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Lazar, Lijo
[Public] BTW, I'm not sure if 'reset always' on resume is a good idea for GPUs in a hive (assuming those systems also get suspended and get hiccups). At this point the hive isn't reinitialized. Thanks, Lijo

Re: [PATCH 2/3] drm/amdkfd: handle VMA remove race

2021-11-18 Thread philip yang
On 2021-11-17 7:10 p.m., Felix Kuehling wrote: On 2021-11-16 10:43 p.m., Philip Yang wrote: VMA may be removed before unmap notifier callback, restore pages take mmap write lock to lookup VMA to avoid race,

RE: 回复: [PATCH Review 3/4] drm/amdgpu: add message smu to get ecc_table v2

2021-11-18 Thread Zhang, Hawking
[AMD Official Use Only] Series looks good to me. Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: Lazar, Lijo Sent: Thursday, November 18, 2021 22:41 To: Yang, Stanley ; amd-gfx@lists.freedesktop.org; Zhang, Hawking ; Clements, John ; Quan, Evan ; Wang,

Re: 回复: [PATCH Review 3/4] drm/amdgpu: add message smu to get ecc_table v2

2021-11-18 Thread Lazar, Lijo
On 11/18/2021 6:05 PM, Yang, Stanley wrote: [AMD Official Use Only] -邮件原件- 发件人: Lazar, Lijo 发送时间: Thursday, November 18, 2021 7:33 PM 收件人: Yang, Stanley ; amd- g...@lists.freedesktop.org; Zhang, Hawking ; Clements, John ; Quan, Evan ; Wang, Yang(Kevin) 主题: Re: [PATCH Review 3/4]

[PATCH] drm/amdgpu/pm: clean up some inconsistent indenting

2021-11-18 Thread Jiapeng Chong
Eliminate the follow smatch warning: drivers/gpu/drm/amd/amdgpu/../pm/powerplay/amd_powerplay.c:1554 pp_asic_reset_mode_2() warn: inconsistent indenting. Reported-by: Abaci Robot Signed-off-by: Jiapeng Chong --- drivers/gpu/drm/amd/pm/powerplay/amd_powerplay.c | 2 +- 1 file changed, 1

Re: [PATCH v10 08/10] dyndbg: add print-to-tracefs, selftest with it - RFC

2021-11-18 Thread Jason Baron
On 11/16/21 3:46 AM, Pekka Paalanen wrote: > On Fri, 12 Nov 2021 10:08:41 -0500 > Jason Baron wrote: > >> On 11/12/21 6:49 AM, Vincent Whitchurch wrote: >>> On Thu, Nov 11, 2021 at 03:02:04PM -0700, Jim Cromie wrote: Sean Paul proposed, in:

Re: [PATCH 1/3] drm/amdkfd: process exit and retry fault race

2021-11-18 Thread philip yang
On 2021-11-17 6:18 p.m., Felix Kuehling wrote: On 2021-11-16 10:43 p.m., Philip Yang wrote: kfd process mmu release notifier callback drain retry fault to ensure no retry fault comes after removing kfd process from the

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Lazar, Lijo
On 11/18/2021 7:55 PM, Alex Deucher wrote: On Thu, Nov 18, 2021 at 9:15 AM Lazar, Lijo wrote: On 11/18/2021 7:41 PM, Christian König wrote: Am 18.11.21 um 15:09 schrieb Lazar, Lijo: On 11/18/2021 7:36 PM, Alex Deucher wrote: On Thu, Nov 18, 2021 at 8:11 AM Liang, Prike wrote:

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Alex Deucher
On Thu, Nov 18, 2021 at 9:15 AM Lazar, Lijo wrote: > > > > On 11/18/2021 7:41 PM, Christian König wrote: > > Am 18.11.21 um 15:09 schrieb Lazar, Lijo: > >> On 11/18/2021 7:36 PM, Alex Deucher wrote: > >>> On Thu, Nov 18, 2021 at 8:11 AM Liang, Prike > >>> wrote: > > [Public] > >

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Lazar, Lijo
On 11/18/2021 7:41 PM, Christian König wrote: Am 18.11.21 um 15:09 schrieb Lazar, Lijo: On 11/18/2021 7:36 PM, Alex Deucher wrote: On Thu, Nov 18, 2021 at 8:11 AM Liang, Prike wrote: [Public] -Original Message- From: Lazar, Lijo Sent: Thursday, November 18, 2021 4:01 PM To:

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Christian König
Am 18.11.21 um 15:09 schrieb Lazar, Lijo: On 11/18/2021 7:36 PM, Alex Deucher wrote: On Thu, Nov 18, 2021 at 8:11 AM Liang, Prike wrote: [Public] -Original Message- From: Lazar, Lijo Sent: Thursday, November 18, 2021 4:01 PM To: Liang, Prike ; amd-gfx@lists.freedesktop.org Cc:

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Lazar, Lijo
On 11/18/2021 7:36 PM, Alex Deucher wrote: On Thu, Nov 18, 2021 at 8:11 AM Liang, Prike wrote: [Public] -Original Message- From: Lazar, Lijo Sent: Thursday, November 18, 2021 4:01 PM To: Liang, Prike ; amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander ; Huang, Ray Subject:

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Alex Deucher
On Thu, Nov 18, 2021 at 8:11 AM Liang, Prike wrote: > > [Public] > > > -Original Message- > > From: Lazar, Lijo > > Sent: Thursday, November 18, 2021 4:01 PM > > To: Liang, Prike ; amd-gfx@lists.freedesktop.org > > Cc: Deucher, Alexander ; Huang, Ray > > > > Subject: Re: [PATCH]

RE: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Liang, Prike
[Public] > -Original Message- > From: Lazar, Lijo > Sent: Thursday, November 18, 2021 4:01 PM > To: Liang, Prike ; amd-gfx@lists.freedesktop.org > Cc: Deucher, Alexander ; Huang, Ray > > Subject: Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend > aborted > > > > On

回复: [PATCH Review 3/4] drm/amdgpu: add message smu to get ecc_table v2

2021-11-18 Thread Yang, Stanley
[AMD Official Use Only] > -邮件原件- > 发件人: Lazar, Lijo > 发送时间: Thursday, November 18, 2021 7:33 PM > 收件人: Yang, Stanley ; amd- > g...@lists.freedesktop.org; Zhang, Hawking ; > Clements, John ; Quan, Evan > ; Wang, Yang(Kevin) > 主题: Re: [PATCH Review 3/4] drm/amdgpu: add message smu to

Re: [PATCH Review 3/4] drm/amdgpu: add message smu to get ecc_table v2

2021-11-18 Thread Lazar, Lijo
On 11/18/2021 3:03 PM, Stanley.Yang wrote: support ECC TABLE message, this table include umc ras error count and error address v2: add smu version check to query whether support ecctable call smu_cmn_update_table to get ecctable directly Signed-off-by: Stanley.Yang ---

RE: [PATCH] drm/amdgpu: update the domain flags for dumb buffer creation

2021-11-18 Thread Chen, Guchun
[Public] Reviewed-by: Guchun Chen Regards, Guchun -Original Message- From: amd-gfx On Behalf Of Evan Quan Sent: Thursday, November 18, 2021 4:27 PM To: amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander ; Quan, Evan ; Koenig, Christian Subject: [PATCH] drm/amdgpu: update the

回复: [PATCH Review 1/4] drm/amdgpu: Update smu driver interface for aldebaran

2021-11-18 Thread Yang, Stanley
[AMD Official Use Only] Thanks Evan, Will update patch 1 and 3 title before submit. Regards, Stanley > -邮件原件- > 发件人: Quan, Evan > 发送时间: Thursday, November 18, 2021 5:58 PM > 收件人: Yang, Stanley ; amd- > g...@lists.freedesktop.org; Zhang, Hawking ; > Clements, John ; Lazar, Lijo > ;

RE: [PATCH Review 1/4] drm/amdgpu: Update smu driver interface for aldebaran

2021-11-18 Thread Quan, Evan
[AMD Official Use Only] Better to update the patch title as "drm/amd/pm: Update smu driver interface for aldebaran" as all other power related patches. And please update patch3 also. Other than above, patch 1, 3 are reviewed-by: Evan Quan > -Original Message- > From: Stanley.Yang >

[PATCH Review 4/4] query umc error info from ecc_table v2

2021-11-18 Thread Stanley . Yang
if smu support ECCTABLE, driver can message smu to get ecc_table then query umc error info from ECCTABLE v2: optimize source code makes logical more reasonable Signed-off-by: Stanley.Yang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 42 +++

[PATCH Review 3/4] drm/amdgpu: add message smu to get ecc_table v2

2021-11-18 Thread Stanley . Yang
support ECC TABLE message, this table include umc ras error count and error address v2: add smu version check to query whether support ecctable call smu_cmn_update_table to get ecctable directly Signed-off-by: Stanley.Yang --- drivers/gpu/drm/amd/pm/inc/amdgpu_smu.h | 8 +++

[PATCH Review 2/4] drm/amdgpu: add new query interface for umc block v2

2021-11-18 Thread Stanley . Yang
add message smu to query error information v2: rename message_smu to ecc_info Signed-off-by: Stanley.Yang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 16 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 4 + drivers/gpu/drm/amd/amdgpu/umc_v6_7.c | 161 3 files

[PATCH Review 1/4] drm/amdgpu: Update smu driver interface for aldebaran

2021-11-18 Thread Stanley . Yang
update smu driver if version to 0x08 to avoid mismatch log A version mismatch can still happen with an older FW Change-Id: I97f2bc4ed9a9cba313b744e2ff6812c90b244935 Signed-off-by: Stanley.Yang --- .../drm/amd/pm/inc/smu13_driver_if_aldebaran.h | 18 +-

Re: [PATCH v1 1/9] mm: add zone device coherent type memory support

2021-11-18 Thread Alistair Popple
On Tuesday, 16 November 2021 6:30:18 AM AEDT Alex Sierra wrote: > Device memory that is cache coherent from device and CPU point of view. > This is used on platforms that have an advanced system bus (like CAPI > or CXL). Any page of a process can be migrated to such memory. However, > no one

Re: [PATCH v1 6/9] lib: test_hmm add module param for zone device type

2021-11-18 Thread Alistair Popple
On Tuesday, 16 November 2021 6:30:23 AM AEDT Alex Sierra wrote: > In order to configure device coherent in test_hmm, two module parameters > should be passed, which correspond to the SP start address of each > device (2) spm_addr_dev0 & spm_addr_dev1. If no parameters are passed, > private device

Re: [PATCH v1 2/9] mm: add device coherent vma selection for memory migration

2021-11-18 Thread Alistair Popple
On Tuesday, 16 November 2021 6:30:19 AM AEDT Alex Sierra wrote: > This case is used to migrate pages from device memory, back to system > memory. Device coherent type memory is cache coherent from device and CPU > point of view. > > Signed-off-by: Alex Sierra > --- > v2: > condition added when

[PATCH] drm/amdgpu: update the domain flags for dumb buffer creation

2021-11-18 Thread Evan Quan
After switching to generic framebuffer framework, we rely on the ->dumb_create routine for frame buffer creation. However, the different domain flags used are not optimal. Add the contiguous flag to directly allocate the scanout BO as one linear buffer. Fixes: 844612e1149d ("drm/amdgpu: use

Re: [PATCH] drm/amdgpu: reset asic after system-wide suspend aborted

2021-11-18 Thread Lazar, Lijo
On 11/18/2021 12:32 PM, Prike Liang wrote: Do ASIC reset at the moment Sx suspend aborted behind of amdgpu suspend to keep AMDGPU in a clean reset state and that can avoid re-initialize device improperly error. Signed-off-by: Prike Liang --- drivers/gpu/drm/amd/amdgpu/amdgpu.h| 1