RE: [PATCH 1/2] drm/amdgpu: add a spinlock to wb allocation

2024-04-22 Thread Liu, Shaoyun
[AMD Official Use Only - General] These two patches Looks good to me . Reviewed by Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Alex Deucher Sent: Monday, April 22, 2024 10:38 AM To: amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander Subject: [PATCH 1/2] drm/amdgpu:

RE: [PATCH 3/3] drm/amdgpu/mes11: make fence waits synchronous

2024-04-18 Thread Liu, Shaoyun
nig, Christian Sent: Thursday, April 18, 2024 1:59 AM To: Alex Deucher ; Liu, Shaoyun Cc: Chen, Horace ; amd-gfx@lists.freedesktop.org; Andrey Grodzovsky ; Kuehling, Felix ; Deucher, Alexander ; Xiao, Jack ; Zhang, Hawking ; Liu, Monk ; Xu, Feifei ; Chang, HaiJun ; Leo Liu ; Liu, Jenny (Jing) Subj

RE: [PATCH 1/3] drm/amdgpu/mes11: print MES opcodes rather than numbers

2024-04-17 Thread Liu, Shaoyun
[AMD Official Use Only - General] Looks good to me . Reviewed by Shaoyun.liu < shaoyun@amd.com> -Original Message- From: amd-gfx On Behalf Of Horace Chen Sent: Wednesday, April 17, 2024 7:30 AM To: amd-gfx@lists.freedesktop.org Cc: Andrey Grodzovsky ; Kuehling, Felix ; Chen, Horace

RE: [PATCH 3/3] drm/amdgpu/mes11: make fence waits synchronous

2024-04-17 Thread Liu, Shaoyun
[AMD Official Use Only - General] I have a discussion with Christian about this before . The conclusion is that driver should prevent multiple process from using the MES ring at the same time . Also for current MES ring usage ,driver doesn't have the logic to prevent the ring been

RE: [PATCH] drm/amdgpu/mes11: print MES opcodes rather than numbers

2024-04-01 Thread Liu, Shaoyun
[AMD Official Use Only - General] Comments inline -Original Message- From: amd-gfx On Behalf Of Alex Deucher Sent: Saturday, March 30, 2024 10:01 AM To: amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander Subject: [PATCH] drm/amdgpu/mes11: print MES opcodes rather than numbers Makes

RE: [PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature

2024-03-26 Thread Liu, Shaoyun
it and check with MES engineer. Regards Shaoyun.liu -Original Message- From: Alex Deucher Sent: Tuesday, March 26, 2024 12:50 PM To: Liu, Shaoyun Cc: amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature On Tue, Mar 26, 2024 at 11:51 AM Liu

RE: [PATCH] drm/amdgpu : Increase the mes log buffer size as per new MES FW version

2024-03-26 Thread Liu, Shaoyun
and may take some time for driver side to pick it up , but before this I'd like to have a solution that can fix the issue ASAP . Regards Shaoyun.liu -Original Message- From: Kuehling, Felix Sent: Tuesday, March 26, 2024 2:07 PM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: Re

RE: [PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature

2024-03-26 Thread Liu, Shaoyun
[AMD Official Use Only - General] ping From: amd-gfx On Behalf Of Liu, Shaoyun Sent: Monday, March 25, 2024 8:51 AM To: amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature [AMD Official Use Only - General] [AMD Official Use Only

RE: [PATCH] drm/amdgpu : Increase the mes log buffer size as per new MES FW version

2024-03-25 Thread Liu, Shaoyun
[AMD Official Use Only - General] It can cause page fault when the log size exceed the page size . -Original Message- From: Kuehling, Felix Sent: Monday, March 25, 2024 2:58 PM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amdgpu : Increase the mes log

Re: [PATCH] drm/amdgpu : Increase the mes log buffer size as per new MES FW version

2024-03-25 Thread Liu, Shaoyun
[AMD Official Use Only - General] Ping Get Outlook for iOS<https://aka.ms/o0ukef> From: Liu, Shaoyun Sent: Friday, March 22, 2024 12:49:56 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amdgpu : Increase the mes log buffe

Re: [PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature

2024-03-25 Thread Liu, Shaoyun
[AMD Official Use Only - General] Ping Get Outlook for iOS<https://aka.ms/o0ukef> From: Liu, Shaoyun Sent: Friday, March 22, 2024 2:00:21 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amdgpu : Add mes_log_enable to control m

RE: [PATCH 9/9] drm/amdgpu: enable MES discovery for GC 11.5.1

2024-02-16 Thread Liu, Shaoyun
[AMD Official Use Only - General] Reviewed by shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Alex Deucher Sent: Thursday, February 15, 2024 3:40 PM To: amd-gfx@lists.freedesktop.org Cc: Zhang, Yifan ; Deucher, Alexander Subject: [PATCH 9/9] drm/amdgpu: enable MES discovery

RE: [PATCH] drm/amdgpu: Only create mes event log debugfs when mes is enabled

2024-02-01 Thread Liu, Shaoyun
[AMD Official Use Only - General] ping -Original Message- From: Liu, Shaoyun Sent: Wednesday, January 31, 2024 9:26 AM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amdgpu: Only create mes event log debugfs when mes is enabled Skip the debugfs file creation

RE: [PATCH] drm/amdgpu: move kiq_reg_write_reg_wait() out of amdgpu_virt.c

2024-01-08 Thread Liu, Shaoyun
[AMD Official Use Only - General] Looks good to me . Reviewed by : Shaoyun.liu -Original Message- From: Deucher, Alexander Sent: Monday, January 8, 2024 4:38 PM To: amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander ; Liu, Shaoyun ; Koenig, Christian Subject: [PATCH] drm/amdgpu

RE: [PATCH] drm/amd: Add a workaround for GFX11 systems that fail to flush TLB

2023-12-14 Thread Liu, Shaoyun
Message- From: Alex Deucher Sent: Thursday, December 14, 2023 10:07 AM To: Liu, Shaoyun Cc: Christian König ; Limonciello, Mario ; Huang, Tim ; amd-gfx@lists.freedesktop.org; Koenig, Christian ; sta...@vger.kernel.org Subject: Re: [PATCH] drm/amd: Add a workaround for GFX11 systems that fail

RE: [PATCH] drm/amd: Add a workaround for GFX11 systems that fail to flush TLB

2023-12-14 Thread Liu, Shaoyun
[AMD Official Use Only - General] The gmc flush tlb function is used on both baremetal and sriov. But the function amdgpu_virt_kiq_reg_write_reg_wait is defined in amdgpu_virt.c with name 'virt' make it appear as a SRIOV only function, this sounds confusion . Will it make more sense to

RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management

2023-12-13 Thread Liu, Shaoyun
be discussed offline with the MES team. We're not going to gain ground discussing this here. The solution has already been merged. Feel free to propose a better solution if you're not satisfied with this one. Jon From: Liu, Shaoyun mailto:shaoyun@amd.com>> Sent: Tuesday, December 12, 2023 11

Re: [PATCH] drm/amdkfd: fix mes set shader debugger process management

2023-12-12 Thread Liu, Shaoyun
kef> From: Kim, Jonathan Sent: Tuesday, December 12, 2023 8:19:09 PM To: Liu, Shaoyun ; Huang, JinHuiEric ; amd-gfx@lists.freedesktop.org Cc: Wong, Alice ; Kuehling, Felix ; Kasiviswanathan, Harish Subject: RE: [PATCH] drm/amdkfd: fix mes set shader debugger process mana

RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management

2023-12-12 Thread Liu, Shaoyun
To: Liu, Shaoyun ; Huang, JinHuiEric ; amd-gfx@lists.freedesktop.org Cc: Wong, Alice ; Kuehling, Felix ; Kasiviswanathan, Harish Subject: RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management [Public] > -Original Message- > From: Liu, Shaoyun > Sent: Tuesday, De

RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management

2023-12-12 Thread Liu, Shaoyun
, Jonathan Sent: Tuesday, December 12, 2023 4:48 PM To: Liu, Shaoyun ; Huang, JinHuiEric ; amd-gfx@lists.freedesktop.org Cc: Wong, Alice ; Kuehling, Felix ; Kasiviswanathan, Harish Subject: RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management [Public] > -Original Mess

RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management

2023-12-12 Thread Liu, Shaoyun
Message- From: Kim, Jonathan Sent: Tuesday, December 12, 2023 4:33 PM To: Liu, Shaoyun ; Huang, JinHuiEric ; amd-gfx@lists.freedesktop.org Cc: Wong, Alice ; Kuehling, Felix ; Kasiviswanathan, Harish Subject: RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management [Public

RE: [PATCH] drm/amdkfd: fix mes set shader debugger process management

2023-12-12 Thread Liu, Shaoyun
[AMD Official Use Only - General] Does this requires the new MES FW for this process_ctx_flush requirement ? Can driver side add logic to guaranty when call SET_SHADER_DEBUGGER, the process address is always valid ? Regards Shaoyun.liu -Original Message- From: amd-gfx On Behalf

RE: [PATCH] drm: Disable XNACK on SRIOV environment

2023-11-02 Thread Liu, Shaoyun
[AMD Official Use Only - General] Looks ok to me . Reviewed-by: Shaoyun.liu -Original Message- From: Kakarya, Surbhi Sent: Thursday, November 2, 2023 12:10 PM To: Kakarya, Surbhi ; amd-gfx@lists.freedesktop.org; Yang, Philip ; Liu, Shaoyun Subject: RE: [PATCH] drm: Disable XNACK

RE: [RFC 1/7] drm/amdgpu: UAPI for user queue management

2023-01-03 Thread Liu, Shaoyun
[AMD Official Use Only - General] What about the existing rocm apps that already use the hsakmt APIs for user queue ? Shaoyun.liu -Original Message- From: Alex Deucher Sent: Tuesday, January 3, 2023 2:22 PM To: Liu, Shaoyun Cc: Kuehling, Felix ; Sharma, Shashank ; amd-gfx

RE: [RFC 1/7] drm/amdgpu: UAPI for user queue management

2023-01-03 Thread Liu, Shaoyun
[AMD Official Use Only - General] Hsakmt has the interfaces for compute user queue. Do we want a unify API for both graphic and compute ? Regards Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Felix Kuehling Sent: Tuesday, January 3, 2023 1:30 PM To: Sharma, Shashank

RE: [PATCH] drm/amdgpu: remove evict_resource for sriov when suspend.

2022-12-05 Thread Liu, Shaoyun
[AMD Official Use Only - General] I agree with Christian . Although on some hypervisior with live migration support , there will be specific API between OS and PF driver to handle the FB content save/restore for VF, in this case , guest side save/restore is not necessary. On other

RE: [PATCH] drm/amdgpu: Ignore stop rlc on SRIOV environment.

2022-11-09 Thread Liu, Shaoyun
[AMD Official Use Only - General] Rewed-by: shaoyun liu -Original Message- From: amd-gfx On Behalf Of Alex Deucher Sent: Wednesday, November 9, 2022 2:07 PM To: Wan, Gavin Cc: amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amdgpu: Ignore stop rlc on SRIOV environment. On Wed,

RE: [PATCH 2/5] drm/amdgpu: stop resubmitting jobs for bare metal reset

2022-10-26 Thread Liu, Shaoyun
-submission for all kind of reset since kernel already signal the reset event to user level (at least for compute stack) ? Regard Sshaoyun.liu -Original Message- From: Koenig, Christian Sent: Wednesday, October 26, 2022 1:27 PM To: Liu, Shaoyun ; Tuikov, Luben ; Prosyak, Vitaly ; Deucher

RE: [PATCH 2/5] drm/amdgpu: stop resubmitting jobs for bare metal reset

2022-10-26 Thread Liu, Shaoyun
[AMD Official Use Only - General] The user space shouldn't care about SRIOV or not , I don't think we need to keep the re-submission for SRIOV as well. The reset from SRIOV could trigger the host do a whole GPU reset which will have the same issue as bare metal. Regards Shaoyun.liu

Re: [PATCH] drm/amdgpu: Skip put_reset_domain if it doesnt exist

2022-09-28 Thread Liu, Shaoyun
Looks OK to me. Reviewed by : shaoyun.liu From: Chander, Vignesh Sent: September 28, 2022 3:03 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun ; Chander, Vignesh Subject: [PATCH] drm/amdgpu: Skip put_reset_domain if it doesnt exist For xgmi sriov

RE: [PATCH] drm/amdgpu: Skip put_reset_domain if it doesnt exist

2022-09-28 Thread Liu, Shaoyun
ent: Wednesday, September 28, 2022 1:38 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun ; Chander, Vignesh Subject: [PATCH] drm/amdgpu: Skip put_reset_domain if it doesnt exist Change-Id: Ifd6121fb94db3fadaa1dee61d35699abe1259409 Signed-off-by: Vignesh Chander --- drivers/gpu/drm/amd/amd

RE: [PATCH 1/4] drm/amdgpu: Introduce gfx software ring(v3)

2022-09-12 Thread Liu, Shaoyun
[AMD Official Use Only - General] Just curious about what's this gfx software ring used for ? who decide the priority , can user request a higher priority or it's predefined ? Thanks Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Andrey Grodzovsky Sent: Monday,

RE: [PATCH] drm/amdgpu: Fix hive reference count leak

2022-09-09 Thread Liu, Shaoyun
[AMD Official Use Only - General] Looks good to me . -Original Message- From: Chander, Vignesh Sent: Friday, September 9, 2022 12:52 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun ; Chander, Vignesh Subject: [PATCH] drm/amdgpu: Fix hive reference count leak both get_xgmi_hive

RE: [PATCH] drm/amdgpu: Use per device reset_domain for XGMI on sriov configuration

2022-09-07 Thread Liu, Shaoyun
[AMD Official Use Only - General] ping -Original Message- From: Liu, Shaoyun Sent: Wednesday, September 7, 2022 11:38 AM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amdgpu: Use per device reset_domain for XGMI on sriov configuration For SRIOV configuration

RE: [PATCH] drm/amdgpu: skip set_topology_info for VF

2022-08-19 Thread Liu, Shaoyun
[AMD Official Use Only - General] Looks good to me . Reviewed-By : shaoyun.liu -Original Message- From: Chander, Vignesh Sent: Thursday, August 18, 2022 1:38 PM To: amd-gfx@lists.freedesktop.org Cc: Kim, Jonathan ; Liu, Shaoyun ; Chander, Vignesh Subject: [PATCH] drm/amdgpu: skip

RE: [Patch V3] drm/amdgpu: Increase tlb flush timeout for sriov

2022-08-11 Thread Liu, Shaoyun
[AMD Official Use Only - General] >From HW point of view , the maximum VF number can reach 16 instead of 12 . >Although currently no product will use the 16 VFs together, not sure about >the future. You can added Acked-by me. I will let Alex & Christion decide whether accept this change.

RE: [PATCH] drm/amdgpu: use sjt mec fw on aldebaran for sriov

2022-08-10 Thread Liu, Shaoyun
the VF with sjt version can be initialized and enabled . Regards Shaoyun.liu -Original Message- From: Alex Deucher Sent: Wednesday, August 10, 2022 12:35 PM To: Liu, Shaoyun Cc: amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amdgpu: use sjt mec fw on aldebaran for sriov On Fri

Re: [PATCH] Increase tlb flush timeout for sriov

2022-08-08 Thread Liu, Shaoyun
As I discussed with Alice ,this change is when multi-vf running compute benchmark (Luxmark) at the same time, which involves multiple vf do the tlb invalidation at the same time. They observed kiq timeout after submit the tlb invalidate command. Although each vf has the invalidate register

RE: [PATCH] drm/amdgpu: fix hive reference leak when reflecting psp topology info

2022-07-28 Thread Liu, Shaoyun
[AMD Official Use Only - General] Looks good to me . BTW , why we didn't catch it on baremetal mode ? Reviewed-by: Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Jonathan Kim Sent: Thursday, July 28, 2022 1:06 PM To: amd-gfx@lists.freedesktop.org Cc: Kim, Jonathan

RE: [PATCH] drm/amdgpu: Ta fw needs to be loaded for SRIOV aldebaran

2022-04-22 Thread Liu, Shaoyun
[AMD Official Use Only] Looks ok to me . You can add reviewed-by: Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Friday, April 22, 2022 12:09 PM To: amd-gfx@lists.freedesktop.org Cc: Yu, David Subject: [PATCH] drm/amdgpu: Ta fw needs to be loaded for SRIOV

RE: [PATCH] drm/amdgpu: Ta fw needs to be loaded for SRIOV aldebaran

2022-04-22 Thread Liu, Shaoyun
[AMD Official Use Only] Please add some more info in the description to explain why we need to add TA in SRIOV guest . Regard Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Friday, April 22, 2022 10:58 AM To: amd-gfx@lists.freedesktop.org Cc: Yu, David

RE: [PATCH] drm/amdgpu: fix aldebaran xgmi topology for vf

2022-03-09 Thread Liu, Shaoyun
, Jonathan Sent: Wednesday, March 9, 2022 6:31 PM To: Kuehling, Felix ; amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: RE: [PATCH] drm/amdgpu: fix aldebaran xgmi topology for vf [Public] > -Original Message- > From: Kuehling, Felix > Sent: March 9, 2022 6:12 PM > To: K

RE: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-03 Thread Liu, Shaoyun
[AMD Official Use Only] Reviewed by : Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Thursday, March 3, 2022 11:25 AM To: amd-gfx@lists.freedesktop.org Cc: Yu, David Subject: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran Add DFC CAP support for

RE: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-03 Thread Liu, Shaoyun
[AMD Official Use Only] Probably just described as follows : Initialize cap microcode in psp_init_sriov_microcode, the ta microcode will be initialized in psp_vxx_init_microcode -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Thursday, March 3, 2022 9:10 AM To:

RE: [PATCH] drm/amdgpu: Add DFC CAP support for aldebaran

2022-03-02 Thread Liu, Shaoyun
[AMD Official Use Only] Can you added more information in the description ? Like why we should not load ta for Aldebaran here. Regards Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of David Yu Sent: Wednesday, March 2, 2022 10:20 PM To: amd-gfx@lists.freedesktop.org Cc:

RE: [PATCH] drm/amdgpu: Fix wait for RLCG command completion

2022-02-15 Thread Liu, Shaoyun
; Liu, Shaoyun Subject: RE: [PATCH] drm/amdgpu: Fix wait for RLCG command completion [AMD Official Use Only] +Shaoyun -Original Message- From: Zhang, Bokun Sent: Monday, February 14, 2022 4:09 PM To: Skvortsov, Victor ; amd-gfx@lists.freedesktop.org Cc: Skvortsov, Victor Subject: RE

RE: [RFC v4 04/11] drm/amd/virt: For SRIOV send GPU reset directly to TDR queue.

2022-02-08 Thread Liu, Shaoyun
, Andrey Sent: Tuesday, February 8, 2022 7:23 PM To: dri-de...@lists.freedesktop.org; amd-gfx@lists.freedesktop.org Cc: Koenig, Christian ; dan...@ffwll.ch; Liu, Monk ; Chen, Horace ; Lazar, Lijo ; Chen, JingWen ; Grodzovsky, Andrey ; Liu, Shaoyun Subject: [RFC v4 04/11] drm/amd/virt: For SRIOV

RE: [PATCH] drm/amdgpu: Fix kernel compilation; style

2022-01-20 Thread Liu, Shaoyun
[AMD Official Use Only] Good catch . Thanks . Reviewed by : shaoyun.liu -Original Message- From: Tuikov, Luben Sent: Thursday, January 20, 2022 6:52 PM To: amd-gfx@lists.freedesktop.org Cc: Tuikov, Luben ; Deucher, Alexander ; Liu, Shaoyun ; Russell, Kent Subject: [PATCH] drm

RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection for SRIOV

2022-01-04 Thread Liu, Shaoyun
: Grodzovsky, Andrey Sent: Tuesday, January 4, 2022 3:55 PM To: Liu, Shaoyun ; Koenig, Christian ; Liu, Monk ; Chen, JingWen ; Christian König ; Deng, Emily ; dri-de...@lists.freedesktop.org; amd-gfx@lists.freedesktop.org; Chen, Horace Cc: dan...@ffwll.ch Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop

RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection for SRIOV

2022-01-04 Thread Liu, Shaoyun
[AMD Official Use Only] I mostly agree with the sequences Christian described . Just one thing might need to discuss here. For FLR notified from host, in new sequenceas described , driver need to reply the READY_TO_RESET in the workitem from a reset work queue which means inside

RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection for SRIOV

2021-12-23 Thread Liu, Shaoyun
[AMD Official Use Only] I have a discussion with Andrey about this offline. It seems dangerous to remove the in_gpu_reset and reset_semm directly inside the flr_work. In the case when the reset is triggered from host side , gpu need to be locked while host perform reset after flr_work

RE: [RFC 0/6] Define and use reset domain for GPU recovery in amdgpu

2021-12-20 Thread Liu, Shaoyun
[AMD Official Use Only] Hi , Andrey I actually has some concerns about this change . 1. on SRIOV configuration , the reset notify coming from host , and driver already trigger a work queue to handle the reset (check xgpu_*_mailbox_flr_work) , is it a good idea to trigger another work

RE: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr handling

2021-12-17 Thread Liu, Shaoyun
[AMD Official Use Only] Reviewed by: Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of sashank saye Sent: Friday, December 17, 2021 1:56 PM To: amd-gfx@lists.freedesktop.org Cc: Saye, Sashank Subject: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr

RE: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr handling

2021-12-17 Thread Liu, Shaoyun
[AMD Official Use Only] Comment inline . -Original Message- From: amd-gfx On Behalf Of sashank saye Sent: Friday, December 17, 2021 1:19 PM To: amd-gfx@lists.freedesktop.org Cc: Saye, Sashank Subject: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr handling

RE: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr handling

2021-12-17 Thread Liu, Shaoyun
[AMD Official Use Only] >From your explanation , seems SMU always need this special handling for SBR >on passthrough mode , but in the code , that only apply to XGMI >configuration. Should you change that as well ? Two comments inline. Regards Shaoyun.liu -Original Message-

RE: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr handling

2021-12-17 Thread Liu, Shaoyun
the SBR on VM on/off and SMU will handle the reset. Can you check after this reset , will SMU still alive ? If it's alive , the driver will trigger the reset again . Regards Shaoyun.liu -Original Message- From: Saye, Sashank Sent: Friday, December 17, 2021 11:53 AM To: Liu, Shaoyun

RE: [PATCH] drm/amdgpu: Send Message to SMU on aldebaran passthrough for sbr handling

2021-12-17 Thread Liu, Shaoyun
[AMD Official Use Only] First , the name of heavy SBR is confusing when you need to go through light SBR code path. Secondary, originally we introduce the light SBR is because on older asic, FW can not synchronize the reset on the devices within the hive, so it depends on driver to sync

RE: [PATCH v3 4/5] drm/amdgpu: get xgmi info before ip_init

2021-12-16 Thread Liu, Shaoyun
[AMD Official Use Only] Reviewed by: shaoyun.liu -Original Message- From: Skvortsov, Victor Sent: Thursday, December 16, 2021 2:43 PM To: amd-gfx@lists.freedesktop.org; Deng, Emily ; Liu, Monk ; Ming, Davis ; Liu, Shaoyun ; Zhou, Peng Ju ; Chen, JingWen ; Chen, Horace ; Nieto

RE: [PATCH 4/5] drm/amdgpu: Initialize Aldebaran RLC function pointers

2021-12-16 Thread Liu, Shaoyun
early_init . Regards Shaoyun.liu -Original Message- From: Skvortsov, Victor Sent: Thursday, December 16, 2021 9:28 AM To: Alex Deucher Cc: amd-gfx list ; Deng, Emily ; Liu, Monk ; Ming, Davis ; Liu, Shaoyun ; Zhou, Peng Ju ; Chen, JingWen ; Chen, Horace ; Nieto, David M Subject: RE:

RE: [PATCH v2] drm/amdgpu: Separate vf2pf work item init from virt data exchange

2021-12-16 Thread Liu, Shaoyun
[AMD Official Use Only] This one looks better and more logical . Reviewed By :Shaoyun.liu -Original Message- From: Skvortsov, Victor Sent: Thursday, December 16, 2021 10:39 AM To: amd-gfx@lists.freedesktop.org; Liu, Shaoyun ; Nieto, David M Cc: Skvortsov, Victor Subject: [PATCH

RE: [PATCH 1/2] drm/amdgpu: Separate vf2pf work item init from virt data exchange

2021-12-15 Thread Liu, Shaoyun
[AMD Official Use Only] Looks ok to me . This serial is Reviewed by: Shaoyun.liu Regards Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Victor Skvortsov Sent: Thursday, December 9, 2021 11:48 AM To: amd-gfx@lists.freedesktop.org Cc: Skvortsov, Victor Subject: [PATCH 1/2]

RE: [PATCH v2 1/2] drm/amd/amdgpu: fix psp tmr bo pin count leak in SRIOV

2021-12-14 Thread Liu, Shaoyun
[AMD Official Use Only] These workaround code looks confusing. For PSP TMR , I think guest side should avoid to load it totally since it's loaded in host side. For gart table , in current code path probably it's ok, but I think if we have a correct sequence in SRIOV , we shouldn't have

RE: [PATCH] drm/amdgpu: SRIOV flr_work should use down_write

2021-12-09 Thread Liu, Shaoyun
[AMD Official Use Only] Sounds reasonable. This patch is Reviewed by : Shaoyun.liu Regards Shaoyun.liu -Original Message- From: Skvortsov, Victor Sent: Thursday, December 9, 2021 1:33 PM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: RE: [PATCH] drm/amdgpu: SRIOV

RE: [PATCH] drm/amdgpu: SRIOV flr_work should use down_write

2021-12-09 Thread Liu, Shaoyun
[AMD Official Use Only] I think it's a good catch for reset_sem, any reason to change the adev->in_gpu_reset ? Regards Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Victor Skvortsov Sent: Thursday, December 9, 2021 12:02 PM To: amd-gfx@lists.freedesktop.org Cc:

RE: [PATCH 1/4] drm/amdgpu: skip reset other device in the same hive if it's SRIOV VF

2021-12-07 Thread Liu, Shaoyun
[AMD Official Use Only] Ok , sounds reasonable. With the suggested modification Patch 1, 2, 3, are Reviewed by : Shaoyun.liu . Patch4 is Acked by : Shaoyun.liu . Regards Shaoyun.liu -Original Message- From: Luo, Zhigang Sent: Tuesday, December 7, 2021 4:55 PM To: Liu, Shaoyun

RE: [PATCH 1/4] drm/amdgpu: skip reset other device in the same hive if it's SRIOV VF

2021-12-07 Thread Liu, Shaoyun
[AMD Official Use Only] This patch looks ok to me . Patch 2 is actually add the PSP xgmi init not the whole XGMI init, can you change the description according to this ? Patch 3, You take the hive lock inside the reset sriov function , but the hive lock already be took before this

RE: [PATCH] drm/amdgpu: skip reset other device in the same hive if it's sriov vf

2021-12-03 Thread Liu, Shaoyun
[AMD Official Use Only] I think you need to describe more details on why the hive reset on guest side is not necessary and how host and guest driver will work together to handle the hive reset . You should have 2 patches together as a serials to handle the FLR and mode 1 reset on XGMI

RE: [PATCH] drm/amdgpu: adjust the kfd reset sequence in reset sriov function

2021-11-30 Thread Liu, Shaoyun
Thanks for the review , change the description as suggested and submitted. Shaoyun.liu -Original Message- From: Kuehling, Felix Sent: Tuesday, November 30, 2021 1:19 AM To: amd-gfx@lists.freedesktop.org; Liu, Shaoyun Subject: Re: [PATCH] drm/amdgpu: adjust the kfd reset sequence

RE: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function

2021-11-22 Thread Liu, Shaoyun
, Felix Sent: Monday, November 22, 2021 10:40 AM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function Am 2021-11-18 um 11:57 a.m. schrieb shaoyunl: > For sriov XGMI configuration, the host driver will handle the h

RE: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function

2021-11-22 Thread Liu, Shaoyun
[AMD Official Use Only] ping -Original Message- From: Liu, Shaoyun Sent: Thursday, November 18, 2021 10:08 PM To: amd-gfx@lists.freedesktop.org Subject: RE: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function [AMD Official Use Only] Ping -Original Message

RE: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function

2021-11-18 Thread Liu, Shaoyun
[AMD Official Use Only] Ping -Original Message- From: Liu, Shaoyun Sent: Thursday, November 18, 2021 11:58 AM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amd/amdgpu: move kfd post_reset out of reset_sriov function For sriov XGMI configuration, the host

RE: [PATCH] drm/amd/amdkfd: Fix kernel panic when reset failed and been triggered again

2021-11-15 Thread Liu, Shaoyun
[AMD Official Use Only] Om, sounds reasonable Thanks Shaoyun.liu -Original Message- From: Kuehling, Felix Sent: Monday, November 15, 2021 11:07 AM To: amd-gfx@lists.freedesktop.org; Liu, Shaoyun Subject: Re: [PATCH] drm/amd/amdkfd: Fix kernel panic when reset failed and been

RE: [PATCH] drm/amd/amdgpu: fix the kfd pre_reset sequence in sriov

2021-11-05 Thread Liu, Shaoyun
, thanks to bring this up, it will remind us to verify on the XGMI configuration on SRIOV. Regards shaoyun.liu -Original Message- From: Kuehling, Felix Sent: Friday, November 5, 2021 1:48 PM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: Re: [PATCH] drm/amd/amdgpu: fix

RE: [PATCH] drm/amdgpu: Get atomicOps info from Host for sriov setup

2021-09-10 Thread Liu, Shaoyun
[AMD Official Use Only] Good catch . my editor seems has auto complete feature and I just select the first one . ☹ Thanks Shaoyun.liu -Original Message- From: Kuehling, Felix Sent: Friday, September 10, 2021 10:19 AM To: amd-gfx@lists.freedesktop.org; Liu, Shaoyun Subject: Re

RE: [PATCH v3 1/1] drm/amdkfd: make needs_pcie_atomics FW-version dependent

2021-09-10 Thread Liu, Shaoyun
[AMD Official Use Only] Looks good to me . Reviewed by Shaoyun.liu < shaoyun@amd.com> -Original Message- From: Kuehling, Felix Sent: Friday, September 10, 2021 1:10 AM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: Re: [PATCH v3 1/1] drm/amdkfd

RE: [PATCH] drm/amdgpu: Get atomicOps info from Host for sriov setup

2021-09-09 Thread Liu, Shaoyun
[AMD Official Use Only] Thanks for the review . I accepted your comments and will sent another change list for review once your change is in. Regards Shaoyun.liu -Original Message- From: Kuehling, Felix Sent: Thursday, September 9, 2021 12:18 PM To: Liu, Shaoyun ; amd-gfx

RE: [PATCH] drm/amdgpu: correct MMSCH 1.0 version

2021-08-16 Thread Liu, Shaoyun
[AMD Official Use Only] Looks ok to me . Reviewed by Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Zhigang Luo Sent: Monday, August 16, 2021 11:04 AM To: amd-gfx@lists.freedesktop.org Cc: Luo, Zhigang Subject: [PATCH] drm/amdgpu: correct MMSCH 1.0 version MMSCH 1.0

RE: [PATCH] drm/amdgpu: correct MMSCH version

2021-08-16 Thread Liu, Shaoyun
[AMD Official Use Only] Is that information from MM team ? Please make sure it won't break the ASICs that use the same code path. Also If this is true for all mmsch_v1.0 , you need to specify this is mmSCH v1.0 , since other MMSCH version will still use this major and minor. Shaoyun.liu

RE: [PATCH 5/5] drm/amdgpu: allocate psp fw private buffer from VRAM for sriov vf

2021-06-03 Thread Liu, Shaoyun
[AMD Official Use Only] I will leave Hawking to comment on this serial . Thanks Shaoyun.liu -Original Message- From: Luo, Zhigang Sent: Thursday, June 3, 2021 11:48 AM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: RE: [PATCH 5/5] drm/amdgpu: allocate psp fw private

RE: [PATCH 4/5] drm/amdgpu: add psp microcode init for arcturus and aldebaran sriov vf

2021-06-03 Thread Liu, Shaoyun
[AMD Official Use Only] This one doesn't looks apply to XGMI TA only , it's for whole PSP init , can you double check it ? Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Zhigang Luo Sent: Thursday, June 3, 2021 10:13 AM To: amd-gfx@lists.freedesktop.org Cc: Luo,

RE: [PATCH 3/5] drm/amdgpu: remove sriov vf mmhub system aperture and fb location programming

2021-06-03 Thread Liu, Shaoyun
[AMD Official Use Only] Looks ok to me . -Original Message- From: amd-gfx On Behalf Of Zhigang Luo Sent: Thursday, June 3, 2021 10:13 AM To: amd-gfx@lists.freedesktop.org Cc: Luo, Zhigang Subject: [PATCH 3/5] drm/amdgpu: remove sriov vf mmhub system aperture and fb location

RE: [PATCH 2/5] drm/amdgpu: remove sriov vf gfxhub fb location programming

2021-06-03 Thread Liu, Shaoyun
[AMD Official Use Only] This looks will affect other ASIC , Can you double check that ? -Original Message- From: amd-gfx On Behalf Of Zhigang Luo Sent: Thursday, June 3, 2021 10:13 AM To: amd-gfx@lists.freedesktop.org Cc: Luo, Zhigang Subject: [PATCH 2/5] drm/amdgpu: remove sriov vf

RE: [PATCH 1/5] drm/amdgpu: remove sriov vf checking from getting fb location

2021-06-03 Thread Liu, Shaoyun
[AMD Official Use Only] Looks ok to me . Reviewed-By : Shaoyun.liu -Original Message- From: amd-gfx On Behalf Of Zhigang Luo Sent: Thursday, June 3, 2021 10:13 AM To: amd-gfx@lists.freedesktop.org Cc: Luo, Zhigang Subject: [PATCH 1/5] drm/amdgpu: remove sriov vf checking from

RE: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration

2021-03-12 Thread Liu, Shaoyun
valid our original design. Regards Shaoyun.liu From: Lazar, Lijo Sent: Friday, March 12, 2021 11:54 AM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org; Quan, Evan ; Zhang, Hawking Subject: RE: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration [AMD Public Use] Looks like

RE: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration

2021-03-12 Thread Liu, Shaoyun
d Quan for comments . Regards Shaoyun.liu From: Lazar, Lijo Sent: Friday, March 12, 2021 8:55 AM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: RE: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration [AMD Public Use] We want to keep ppt_funcs minimal. Adding ever

Re: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration

2021-03-12 Thread Liu, Shaoyun
will have this support without further code change. Thanks Shaoyun.liu From: Lazar, Lijo Sent: March 11, 2021 10:42 PM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: RE: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough

RE: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration

2021-03-11 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] Ping . -Original Message- From: Liu, Shaoyun Sent: Thursday, March 11, 2021 12:16 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH] drm/amdgpu: Enable light SBR in XGMI+passthrough configuration This is to fix

RE: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-03-08 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] Hi, Andrey. The first 3 patches in this serial already been acked by Alex. D, can you help review the rest two ? Thanks Shaoyun.liu -Original Message- From: Grodzovsky, Andrey Sent: Monday, March 8, 2021 10:53 AM To: Liu, Shaoyun

RE: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-03-08 Thread Liu, Shaoyun
estroy the hive , build the device_list for amdgpu_do_asic_reset without the hive . Hope I explain it clearly . Thanks Shaoyun.liu -Original Message- From: Grodzovsky, Andrey Sent: Monday, March 8, 2021 1:28 AM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: Re: [PATCH 5/

RE: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-03-06 Thread Liu, Shaoyun
- From: amd-gfx On Behalf Of Liu, Shaoyun Sent: Saturday, March 6, 2021 3:41 PM To: Grodzovsky, Andrey ; amd-gfx@lists.freedesktop.org Subject: RE: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe [AMD Official Use Only - Internal Distribution Only]

RE: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-03-06 Thread Liu, Shaoyun
all the GPUs in the system which means when this delayed thread starts , we can assume all the devices already been populated in mgpu_info. Regards Shaoyun.liu -Original Message- From: Grodzovsky, Andrey Sent: Saturday, March 6, 2021 1:09 AM To: Liu, Shaoyun ; amd-gfx

RE: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-03-05 Thread Liu, Shaoyun
verify it. Regards Shaoyun.liu -Original Message- From: Grodzovsky, Andrey Sent: Friday, March 5, 2021 2:27 PM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Subject: Re: [PATCH 5/5] drm/amdgpu: Reset the devices in the XGMI hive duirng probe On 2021-03-05 12:52 p.m., shaoyunl

RE: [PATCH 4/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-02-24 Thread Liu, Shaoyun
can not guarantee that . For none-XGMI configuration , there is no this requirement , SBR can reset the GPU correctly . Regards Shaoyun.liu -Original Message- From: Lazar, Lijo Sent: Wednesday, February 24, 2021 8:58 AM To: Liu, Shaoyun ; amd-gfx@lists.freedesktop.org Cc: Liu

RE: [PATCH 2/4] drm/amdgpu: get xgmi info at eary_init

2021-02-23 Thread Liu, Shaoyun
-Original Message- From: Alex Deucher Sent: Tuesday, February 23, 2021 11:26 AM To: Liu, Shaoyun Cc: amd-gfx list Subject: Re: [PATCH 2/4] drm/amdgpu: get xgmi info at eary_init On Thu, Feb 18, 2021 at 8:19 PM shaoyunl wrote: > > Driver need to get XGMI info function earlier before i

RE: [PATCH 1/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-02-23 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] Comments inline , -Original Message- From: Alex Deucher Sent: Tuesday, February 23, 2021 11:47 AM To: Liu, Shaoyun Cc: amd-gfx list Subject: Re: [PATCH 1/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe On Thu, Feb

RE: [PATCH 4/4] drm/amdgpu: Init the cp MQD if it's not be initialized before

2021-02-22 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] -Original Message- From: Liu, Shaoyun Sent: Thursday, February 18, 2021 8:20 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH 4/4] drm/amdgpu: Init the cp MQD if it's not be initialized before The MQD might

RE: [PATCH 3/4] drm/amdgpu: Add kfd init_complete flag to check from amdgpu side

2021-02-22 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] -Original Message- From: Liu, Shaoyun Sent: Thursday, February 18, 2021 8:20 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH 3/4] drm/amdgpu: Add kfd init_complete flag to check from amdgpu side amdgpu

RE: [PATCH 2/4] drm/amdgpu: get xgmi info at eary_init

2021-02-22 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] -Original Message- From: Liu, Shaoyun Sent: Thursday, February 18, 2021 8:19 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH 2/4] drm/amdgpu: get xgmi info at eary_init Driver need to get XGMI info function

RE: [PATCH 1/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-02-22 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] Ping. -Original Message- From: Liu, Shaoyun Sent: Thursday, February 18, 2021 8:19 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH 1/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

RE: [PATCH 1/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

2021-02-19 Thread Liu, Shaoyun
[AMD Official Use Only - Internal Distribution Only] Ping . -Original Message- From: Liu, Shaoyun Sent: Thursday, February 18, 2021 8:19 PM To: amd-gfx@lists.freedesktop.org Cc: Liu, Shaoyun Subject: [PATCH 1/4] drm/amdgpu: Reset the devices in the XGMI hive duirng probe

  1   2   3   >