RE: [PATCH] drm/amdgpu: update GC golden setting for navy_flounder

2020-07-28 Thread Zhou1, Tao
[AMD Official Use Only - Internal Distribution Only] Reviewed-by: Tao Zhou > -Original Message- > From: Jiansong Chen > Sent: Wednesday, July 29, 2020 12:02 PM > To: amd-gfx@lists.freedesktop.org > Cc: Zhou1, Tao ; Chen, Jiansong (Simon) > > Subject: [PATCH] drm/amdgpu: update GC

[PATCH] drm/amdgpu: update GC golden setting for navy_flounder

2020-07-28 Thread Jiansong Chen
Update GC golden setting for navy_flounder. Signed-off-by: Jiansong Chen Change-Id: Ia7e82616b0be48f397c73b015823ac10ef907f08 --- drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c

[PATCH 11/12] drm/amdgpu: disable page reservation when amdgpu_bad_page_threshold = 0

2020-07-28 Thread Guchun Chen
When amdgpu_bad_page_threshold = 0, bad page reservation stuffs are skipped in either UMC ECC irq or page retirement calling of sync flood isr. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 +++-- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 5 +++-- 2 files changed, 6

[PATCH 09/12] drm/amdgpu: add one definition for RAS's sysfs/debugfs name

2020-07-28 Thread Guchun Chen
Add one definition for the RAS module's FS name. It's used in both debugfs and sysfs cases. v2: Use static variable instead of macro definition. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 - 1 file changed, 8 insertions(+), 5 deletions(-) diff

[PATCH 12/12] drm/amdgpu: update eeprom once specifying one bigger threshold

2020-07-28 Thread Guchun Chen
During driver's probe, when it hits bad gpu tag in eeprom i2c init calling(the tag was set when reported bad page reaches bad page threshold in last driver's working loop), there are some strategys to deal with the cases: 1. when the module parameter amdgpu_bad_page_threshold = 0, that means page

[PATCH 08/12] drm/amdgpu: restore ras flags when user resets eeprom

2020-07-28 Thread Guchun Chen
RAS flags needs to be cleaned as well when user requires one clean eeprom. v2: RAS flags shall be restored after eeprom reset succeeds. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 ++--- 1 file changed, 10 insertions(+), 3 deletions(-) diff --git

[PATCH 10/12] drm/amdgpu: decouple sysfs creating of bad page node

2020-07-28 Thread Guchun Chen
Bad page information should not be exposed by sysfs when bad page retirement is disabled, so decouple it from ras sysfs group creating, and add one guard before creating. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 71 - 1 file changed, 46

[PATCH 02/12] drm/amdgpu: validate bad page threshold in ras

2020-07-28 Thread Guchun Chen
Bad page threshold value should be valid in the range between -1 and max records length of eeprom. It could determine when saved bad pages exceed threshold value, and proceed corresponding actions. v2: When using the default typical value, it should be min value between typical value and eeprom

[PATCH 06/12] drm/amdgpu: schedule ras recovery when reaching bad page threshold

2020-07-28 Thread Guchun Chen
Once the bad page saved to eeprom reaches the configured threshold, ras recovery will be issued to notify user. v2: Fix spelling typo. Signed-off-by: Guchun Chen --- .../gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c| 37 ++- 1 file changed, 36 insertions(+), 1 deletion(-) diff

[PATCH 04/12] drm/amdgpu: break driver init process when it's bad GPU

2020-07-28 Thread Guchun Chen
When retrieving bad gpu tag from eeprom, GPU init should fail as the GPU needs to be retired for further check. v2: Fix spelling typo, correct the condition to detect bad gpu tag and refine error message. v3: Refine function argument name. v4: Fix missing check of returning value of i2c

[PATCH 07/12] drm/amdgpu: break GPU recovery once it's in bad state

2020-07-28 Thread Guchun Chen
When GPU executes recovery and retriving bad GPU tag from external eerpom device, the recovery will be broken and error message is printed as well for user's awareness. v2: Refine warning message in threshold reaching case, and fix spelling typo. v3: Fix explicit calling of bad gpu. v4:

[PATCH 05/12] drm/amdgpu: skip bad page reservation once issuing from eeprom write

2020-07-28 Thread Guchun Chen
Once the ras recovery is issued from eeprom write itself, bad page reservation should be ignored, otherwise, recursive calling of writting to eeprom would happen. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 -- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 14

[PATCH 03/12] drm/amdgpu: add bad gpu tag definition

2020-07-28 Thread Guchun Chen
This tag will be hired for bad gpu detection in eeprom's access. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c | 3 +++ 1 file changed, 3 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c

[PATCH 00/12] BAD GPU retirement policy by total bad pages

2020-07-28 Thread Guchun Chen
The series is to enable/disable bad page feature and apply different bad page reservation strategy by different bad page threshold configurations. When the saved bad pages written to eeprom reach the threshold, one ras recovery will be issued immediately and the recovery will fail to tell user

[PATCH 01/12] drm/amdgpu: add bad page count threshold in module parameter

2020-07-28 Thread Guchun Chen
bad_page_threshold could be configured to enable/disable the associated bad page retirement feature in RAS. When it's -1, ras will use typical bad page failure value to handle bad page retirement. When it's 0, disable bad page retirement, and no bad page will be recorded and saved. For other

Re: [PATCH 2/2] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit

2020-07-28 Thread Yin, Tianci (Rico)
[AMD Official Use Only - Internal Distribution Only] Thanks very much Lunben and Guchun! Regards, Rico From: Tuikov, Luben Sent: Wednesday, July 29, 2020 2:44 To: Yin, Tianci (Rico) ; amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander ; Zhang, Hawking ; Xu,

Re: [PATCH 1/2] drm/amdgpu: add interface amdgpu_gfx_init_spm_golden for Navi1x

2020-07-28 Thread Yin, Tianci (Rico)
[AMD Official Use Only - Internal Distribution Only] Thanks very much Luben! Regards, Rico From: Tuikov, Luben Sent: Wednesday, July 29, 2020 2:29 To: Yin, Tianci (Rico) ; amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander ; Zhang, Hawking ; Xu, Feifei ;

Re: [PATCH] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit(v2)

2020-07-28 Thread Yin, Tianci (Rico)
[AMD Public Use] Hi Alex, amdgpu_gfx_off_ctrl() invoked by a few other functions, like amdgpu_info_ioctl() , putting the code into amdgpu_gfx_off_ctrl() will cost more meaningless time on SPM golden reconfiguration. amdgpu_gfx_off_ctrl(adev, false); amdgpu_asic_read_register(adev, se_num,

RE: [PATCH] drm/amd/powerplay: update driver if version for navy_flounder

2020-07-28 Thread Zhou1, Tao
[AMD Official Use Only - Internal Distribution Only] Reviewed-by: Tao Zhou > -Original Message- > From: Jiansong Chen > Sent: Tuesday, July 28, 2020 7:21 PM > To: amd-gfx@lists.freedesktop.org > Cc: Zhou1, Tao ; Feng, Kenneth > ; Chen, Jiansong (Simon) > Subject: [PATCH]

Re: [PATCH 01/14] drm/amdgpu: handle bo size 0 in amdgpu_bo_create_kernel_at

2020-07-28 Thread Felix Kuehling
Am 2020-07-28 um 6:45 p.m. schrieb Alex Deucher: > Just return early to match other bo_create functions. > > Signed-off-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 5 + > 1 file changed, 5 insertions(+) > > diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c >

Re: [PATCH 14/14] drm/amdgpu/gmc: disable keep_stolen_vga_memory on arcturus

2020-07-28 Thread Felix Kuehling
Am 2020-07-28 um 6:46 p.m. schrieb Alex Deucher: > I suspect the only reason this was set was to avoid touching > the display related registers on arcturus. Someone should > double check this on arcturus with S3. Sounds reasonable, given that the other offenders here are all APUs. AFAIK, we

[PATCH 01/14] drm/amdgpu: handle bo size 0 in amdgpu_bo_create_kernel_at

2020-07-28 Thread Alex Deucher
Just return early to match other bo_create functions. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 5 + 1 file changed, 5 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c index

[PATCH 13/14] drm/amdgpu: drop the CPU pointers for the stolen vga bos

2020-07-28 Thread Alex Deucher
We never use them. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c | 16 +--- 1 file changed, 5 insertions(+), 11 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c index ec975251b171..3df9d5a53741

[PATCH 03/14] drm/amdgpu: use a define for the memory size of the vga emulator

2020-07-28 Thread Alex Deucher
Rather than open coding it everywhere. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu.h| 2 ++ drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c | 2 +- drivers/gpu/drm/amd/amdgpu/gmc_v6_0.c | 2 +- drivers/gpu/drm/amd/amdgpu/gmc_v7_0.c | 2 +-

[PATCH 05/14] drm/amdgpu: move keep stolen memory check into gmc core

2020-07-28 Thread Alex Deucher
Rather than leaving this as a gmc v9 specific hack. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c | 9 - drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 11 +++ 3 files changed, 12 insertions(+), 9 deletions(-)

[PATCH 00/14] rework stolen pre-OS fb allocation handling

2020-07-28 Thread Alex Deucher
Split the allocations into two so we can still support the S3 workarounds required on some platforms while also avoiding any artifacts when transitioning from bios to driver. In the future we could integrate handling of the ip discovery data and other vbios allocations into this helper function

[PATCH 12/14] drm/amdgpu/gmc10: switch to using amdgpu_gmc_get_vbios_allocations

2020-07-28 Thread Alex Deucher
The new helper centralizes the logic in one place. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c | 57 +++--- 1 file changed, 25 insertions(+), 32 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c

[PATCH 11/14] drm/amdgpu/gmc9: switch to using amdgpu_gmc_get_vbios_allocations

2020-07-28 Thread Alex Deucher
The new helper centralizes the logic in one place. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 112 +- 1 file changed, 38 insertions(+), 74 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c

[PATCH 04/14] drm/amdgpu: move stolen vga bo from amdgpu to amdgpu.gmc

2020-07-28 Thread Alex Deucher
Since that is where we store the other data related to the stolen vga memory. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 1 - drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 3 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c | 6 +++---

[PATCH 09/14] drm/amdgpu/gmc7: switch to using amdgpu_gmc_get_vbios_allocations

2020-07-28 Thread Alex Deucher
The new helper centralizes the logic in one place. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gmc_v7_0.c | 9 - 1 file changed, 4 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v7_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v7_0.c index

[PATCH 06/14] drm/amdgpu: add support for extended stolen vga memory

2020-07-28 Thread Alex Deucher
This will allow us to split the allocation for systems where we have to keep the stolen memory around to avoid S3 issues. This way we don't waste as much memory and still avoid any screen artifacts during the bios to driver transition. Signed-off-by: Alex Deucher ---

[PATCH 02/14] drm/amdgpu: use create_at for the stolen pre-OS buffer

2020-07-28 Thread Alex Deucher
Should be functionally the same since nothing else is allocated at that point, but let's be exact. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c | 8 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c

[PATCH 14/14] drm/amdgpu/gmc: disable keep_stolen_vga_memory on arcturus

2020-07-28 Thread Alex Deucher
I suspect the only reason this was set was to avoid touching the display related registers on arcturus. Someone should double check this on arcturus with S3. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 1 - 1 file changed, 1 deletion(-) diff --git

[PATCH 08/14] drm/amdgpu/gmc6: switch to using amdgpu_gmc_get_vbios_allocations

2020-07-28 Thread Alex Deucher
The new helper centralizes the logic in one place. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gmc_v6_0.c | 6 ++ 1 file changed, 2 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v6_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v6_0.c index

[PATCH 07/14] drm/amdgpu/gmc: add new helper to get the FB size used by pre-OS console

2020-07-28 Thread Alex Deucher
This adds a new gmc callback to get the size reserved by the pre-OS console and provides a helper function for use by gmc IP drivers. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 43 + drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 5 +++ 2 files

[PATCH 10/14] drm/amdgpu/gmc8: switch to using amdgpu_gmc_get_vbios_allocations

2020-07-28 Thread Alex Deucher
The new helper centralizes the logic in one place. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gmc_v8_0.c | 9 - 1 file changed, 4 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v8_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v8_0.c index

Re: [PATCH] amdgpu_dm: fix nonblocking atomic commit use-after-free

2020-07-28 Thread daniel
On Tue, Jul 28, 2020 at 01:07:13PM -0400, Kazlauskas, Nicholas wrote: > On 2020-07-28 5:22 a.m., Paul Menzel wrote: > > Dear Linux folks, > > > > > > Am 25.07.20 um 07:20 schrieb Mazin Rezk: > > > On Saturday, July 25, 2020 12:59 AM, Duncan wrote: > > > > > > > On Sat, 25 Jul 2020 03:03:52

Re: [PATCH 4/4] radeon: fall back to ACPI EDID retrieval

2020-07-28 Thread Daniel Dadap
On 7/28/20 1:50 AM, Christian König wrote: Am 27.07.20 um 22:53 schrieb Daniel Dadap: Fall back to retrieving the EDID via the ACPI _DDC method, when present for notebook internal panels, when retrieving BIOS-embedded EDIDs. Signed-off-by: Daniel Dadap ---  

[Linux-kernel-mentees] [PATCH] drm/amdgpu: Prevent kernel-infoleak in amdgpu_info_ioctl()

2020-07-28 Thread Peilin Ye
Compiler leaves a 4-byte hole near the end of `dev_info`, causing amdgpu_info_ioctl() to copy uninitialized kernel stack memory to userspace when `size` is greater than 356. In 2015 we tried to fix this issue by doing `= {};` on `dev_info`, which unfortunately does not initialize that 4-byte

Re: [PATCH] drm/amdgpu: fix PSP autoload twice in FLR

2020-07-28 Thread Luben Tuikov
On 2020-07-28 2:04 p.m., Luben Tuikov wrote: > Thanks for removing the braces. > > On 2020-07-27 10:29 p.m., Liu ChengZhe wrote: >> the block->status.hw = false assignment will overwrite PSP's previous > ^^ > You want to start a sentence here. Capitalize "The". > Also don't use future tense

Re: [PATCH 2/2] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit

2020-07-28 Thread Luben Tuikov
On 2020-07-28 1:27 a.m., Tianci Yin wrote: > From: "Tianci.Yin" > > On Navi1x, the SPM golden settings will be lost after GFXOFF enter/exit, " are lost " > reconfigure the golden settings after GFXOFF exit. " so reconfigure ..." > > Change-Id: I9358ba9c65f241c36f8a35916170b19535148ee9 >

Re: [PATCH 1/2] drm/amdgpu: add interface amdgpu_gfx_init_spm_golden for Navi1x

2020-07-28 Thread Luben Tuikov
On 2020-07-28 1:27 a.m., Tianci Yin wrote: > From: "Tianci.Yin" > > On Navi1x, the SPM golden settings will be lost after GFXOFF enter/exit, Use present tense:... " are lost after " > reconfiguration is needed. Make the configuration code as an interface for Add "so a

[PATCH 1/1] drm/ttm: fix offset in VMAs with a pg_offs in ttm_bo_vm_access

2020-07-28 Thread Felix Kuehling
VMAs with a pg_offs that's offset from the start of the vma_node need to adjust the offset within the BO accordingly. This matches the offset calculation in ttm_bo_vm_fault_reserved. Signed-off-by: Felix Kuehling Tested-by: Laurent Morichetti --- drivers/gpu/drm/ttm/ttm_bo_vm.c | 4 +++- 1

Re: [PATCH 1/2] drm amdgpu: Skip tmr load for SRIOV

2020-07-28 Thread Luben Tuikov
On 2020-07-28 1:36 a.m., Liu ChengZhe wrote: > 1. For Navi12, CHIP_SIENNA_CICHLID, skip tmr load operation; > 2. Check pointer before release firmware. > > v2: use CHIP_SIENNA_CICHLID instead > v3: remove local "bool ret"; fix grammer issue > v4: use my name instead of "root"

Re: [PATCH 1/2] drm amdgpu: Skip tmr load for SRIOV

2020-07-28 Thread Luben Tuikov
Thanks for this patch. On 2020-07-28 1:12 a.m., Liu ChengZhe wrote: > From: root You should fix your Git setup to show proper user name, not "root". I've prepared a Confluence page which shows a way to do it, and a few other things along the way:

Re: [PATCH] drm/amdgpu: fix PSP autoload twice in FLR

2020-07-28 Thread Luben Tuikov
Thanks for removing the braces. On 2020-07-27 10:29 p.m., Liu ChengZhe wrote: > the block->status.hw = false assignment will overwrite PSP's previous ^^ You want to start a sentence here. Capitalize "The". Also don't use future tense in commit descriptions (and commit titles). Simply use

Re: [PATCH] amdgpu_dm: fix nonblocking atomic commit use-after-free

2020-07-28 Thread Kazlauskas, Nicholas
On 2020-07-28 5:22 a.m., Paul Menzel wrote: Dear Linux folks, Am 25.07.20 um 07:20 schrieb Mazin Rezk: On Saturday, July 25, 2020 12:59 AM, Duncan wrote: On Sat, 25 Jul 2020 03:03:52 + Mazin Rezk wrote: Am 24.07.20 um 19:33 schrieb Kees Cook: There was a fix to disable the async

Re: [PATCH] drm/amd/display: parse ta firmware for navy_flounder

2020-07-28 Thread Alex Deucher
On Tue, Jul 28, 2020 at 11:43 AM Bhawanpreet Lakha wrote: > > Use the same case as sienna_cichlid > > Signed-off-by: Bhawanpreet Lakha Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/psp_v11_0.c | 3 +-- > 1 file changed, 1 insertion(+), 2 deletions(-) > > diff --git

[PATCH] drm/amd/display: parse ta firmware for navy_flounder

2020-07-28 Thread Bhawanpreet Lakha
Use the same case as sienna_cichlid Signed-off-by: Bhawanpreet Lakha --- drivers/gpu/drm/amd/amdgpu/psp_v11_0.c | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/psp_v11_0.c b/drivers/gpu/drm/amd/amdgpu/psp_v11_0.c index d488d250805d..e16874f30d5d

Re: [PATCH] amdgpu_dm: fix nonblocking atomic commit use-after-free

2020-07-28 Thread Paul Menzel
Dear Linux folks, Am 25.07.20 um 07:20 schrieb Mazin Rezk: On Saturday, July 25, 2020 12:59 AM, Duncan wrote: On Sat, 25 Jul 2020 03:03:52 + Mazin Rezk wrote: Am 24.07.20 um 19:33 schrieb Kees Cook: There was a fix to disable the async path for this driver that worked around the bug

Re: [PATCH] drm/amdgpu: introduce a new parameter to configure how many KCQ we want(v3)

2020-07-28 Thread Felix Kuehling
Am 2020-07-28 um 5:00 a.m. schrieb Monk Liu: > what: > the MQD's save and restore of KCQ (kernel compute queue) > cost lots of clocks during world switch which impacts a lot > to multi-VF performance > > how: > introduce a paramter to control the number of KCQ to avoid > performance drop if there

Re: [PATCH] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit(v2)

2020-07-28 Thread Deucher, Alexander
[AMD Public Use] Would it be better to put this code into amdgpu_gfx_off_ctrl()? Then we'll handle this in all cases where we disable gfx off. Alex From: Tianci Yin Sent: Tuesday, July 28, 2020 3:04 AM To: amd-gfx@lists.freedesktop.org Cc: Tuikov, Luben ;

RE: [PATCH 04/12] drm/amdgpu: break driver init process when it's bad GPU

2020-07-28 Thread Chen, Guchun
[AMD Public Use] Hi Dennis, Please check my response after yours. Regards, Guchun -Original Message- From: Li, Dennis Sent: Tuesday, July 28, 2020 5:43 PM To: Chen, Guchun ; amd-gfx@lists.freedesktop.org; Deucher, Alexander ; Zhang, Hawking ; Grodzovsky, Andrey ; Zhou1, Tao ;

Re: [PATCH] drm/amd/powerplay: update driver if version for navy_flounder

2020-07-28 Thread Feng, Kenneth
[AMD Official Use Only - Internal Distribution Only] Reviewed-by: Kenneth Feng 在 2020/7/28 下午7:21,“Jiansong Chen” 写入: It's in accordance with pmfw 65.5.0 for navy_flounder. Signed-off-by: Jiansong Chen Change-Id: I984a1147030264adbc02230e2e1dd416d4ad63b0 ---

Re: [PATCH] drm/amd/display: Clear dm_state for fast updates

2020-07-28 Thread Mazin Rezk
On Monday, July 27, 2020 7:42 PM, Mazin Rezk wrote: > On Monday, July 27, 2020 5:32 PM, Daniel Vetter wrote: > > > On Mon, Jul 27, 2020 at 11:11 PM Mazin Rezk wrote: > > > > > > On Monday, July 27, 2020 4:29 PM, Daniel Vetter wrote: > > > > > > > On Mon, Jul 27, 2020 at 9:28 PM Christian

Re: [PATCH] drm/amd/display: Clear dm_state for fast updates

2020-07-28 Thread Mazin Rezk
On Monday, July 27, 2020 4:29 PM, Daniel Vetter wrote: > On Mon, Jul 27, 2020 at 9:28 PM Christian König > wrote: > > > > Am 27.07.20 um 16:05 schrieb Kazlauskas, Nicholas: > > > On 2020-07-27 9:39 a.m., Christian König wrote: > > >> Am 27.07.20 um 07:40 schrieb Mazin Rezk: > > >>> This patch

Re: [PATCH] drm/amd/display: Clear dm_state for fast updates

2020-07-28 Thread Mazin Rezk
On Monday, July 27, 2020 5:32 PM, Daniel Vetter wrote: > On Mon, Jul 27, 2020 at 11:11 PM Mazin Rezk wrote: > > > > On Monday, July 27, 2020 4:29 PM, Daniel Vetter wrote: > > > > > On Mon, Jul 27, 2020 at 9:28 PM Christian König > > > wrote: > > > > > > > > Am 27.07.20 um 16:05 schrieb

[PATCH] drm/amd/powerplay: update driver if version for navy_flounder

2020-07-28 Thread Jiansong Chen
It's in accordance with pmfw 65.5.0 for navy_flounder. Signed-off-by: Jiansong Chen Change-Id: I984a1147030264adbc02230e2e1dd416d4ad63b0 --- drivers/gpu/drm/amd/powerplay/inc/smu_v11_0.h | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git

Re: [PATCH 4/4] drm/amdgpu: assign the cpu/gpu address of fence from ring

2020-07-28 Thread Christian König
Am 28.07.20 um 12:21 schrieb Jack Xiao: assign the cpu/gpu address of fence for the normal or mes ring from ring structure. Signed-off-by: Jack Xiao Reviewed-by: Hawking Zhang Acked-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c | 4 ++-- 1 file changed, 2

Re: [PATCH 2/4] drm/amdgpu: initialize the cpu/gpu address of rptr/wptr/fence

2020-07-28 Thread Christian König
Am 28.07.20 um 12:21 schrieb Jack Xiao: Initialize the cpu/gpu address of rptr/wptr/fence. Signed-off-by: Jack Xiao --- drivers/gpu/drm/amd/amdgpu/amdgpu_ring.c | 37 1 file changed, 32 insertions(+), 5 deletions(-) diff --git

RE: [PATCH 4/4] drm/amdgpu: assign the cpu/gpu address of fence from ring

2020-07-28 Thread Ma, Le
[AMD Public Use] Series is Reviewed-by: Le Ma Regards, Ma Le -Original Message- From: Xiao, Jack Sent: Tuesday, July 28, 2020 6:22 PM To: amd-gfx@lists.freedesktop.org; Deucher, Alexander ; Zhang, Hawking ; Koenig, Christian ; Ma, Le Cc: Xiao, Jack ; Koenig, Christian Subject:

[PATCH 2/4] drm/amdgpu: initialize the cpu/gpu address of rptr/wptr/fence

2020-07-28 Thread Jack Xiao
Initialize the cpu/gpu address of rptr/wptr/fence. Signed-off-by: Jack Xiao --- drivers/gpu/drm/amd/amdgpu/amdgpu_ring.c | 37 1 file changed, 32 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ring.c

[PATCH 3/4] drm/amdgpu: use ring structure to access rptr/wptr v2

2020-07-28 Thread Jack Xiao
Use ring structure to access the cpu/gpu address of rptr/wptr. v2: merge gfx10/sdma5/sdma5.2 patches Signed-off-by: Jack Xiao Reviewed-by: Christian König Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/cik_sdma.c | 8 +++--- drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 37

[PATCH 4/4] drm/amdgpu: assign the cpu/gpu address of fence from ring

2020-07-28 Thread Jack Xiao
assign the cpu/gpu address of fence for the normal or mes ring from ring structure. Signed-off-by: Jack Xiao Reviewed-by: Hawking Zhang Acked-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git

[PATCH 1/4] drm/amdgpu: define ring structure to access rptr/wptr/fence

2020-07-28 Thread Jack Xiao
Define ring structure to access the cpu/gpu address of rptr/wptr/fence instead of dynamic calculation. Cc: Christian König Suggested-by: Christian König Signed-off-by: Jack Xiao Reviewed-by: Hawking Zhang Acked-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 6 ++ 1

RE: [PATCH 04/12] drm/amdgpu: break driver init process when it's bad GPU

2020-07-28 Thread Li, Dennis
[AMD Official Use Only - Internal Distribution Only] Hi, Guchun, Please see my below comments. Best Regards Dennis Li -Original Message- From: Chen, Guchun Sent: Tuesday, July 28, 2020 3:49 PM To: amd-gfx@lists.freedesktop.org; Deucher, Alexander ; Zhang, Hawking ; Li, Dennis ;

RE: FW: [PATCH] drm/amdgpu: introduce a new parameter to configure how many KCQ we want(v3)

2020-07-28 Thread Liu, Monk
[AMD Official Use Only - Internal Distribution Only] I repeated the patch broadcast through git-send-email _ Monk Liu|GPU Virtualization Team |AMD -Original Message- From: Koenig, Christian Sent: Tuesday, July 28, 2020 5:04 PM To: Liu, Monk ;

Re: [PATCH] drm/amd/display: Clear dm_state for fast updates

2020-07-28 Thread daniel
On Mon, Jul 27, 2020 at 10:49:48PM -0400, Kazlauskas, Nicholas wrote: > On 2020-07-27 5:32 p.m., Daniel Vetter wrote: > > On Mon, Jul 27, 2020 at 11:11 PM Mazin Rezk wrote: > > > > > > On Monday, July 27, 2020 4:29 PM, Daniel Vetter wrote: > > > > > > > On Mon, Jul 27, 2020 at 9:28 PM

Re: FW: [PATCH] drm/amdgpu: introduce a new parameter to configure how many KCQ we want(v3)

2020-07-28 Thread Christian König
The patch looks totally mangled to me, e.g. some spaces and new lines are missing. Probably because it was forwarded. Christian. Am 28.07.20 um 10:59 schrieb Liu, Monk: [AMD Official Use Only - Internal Distribution Only] -Original Message- From: Monk Liu Sent: Tuesday, July 28,

[PATCH] drm/amdgpu: introduce a new parameter to configure how many KCQ we want(v3)

2020-07-28 Thread Monk Liu
what: the MQD's save and restore of KCQ (kernel compute queue) cost lots of clocks during world switch which impacts a lot to multi-VF performance how: introduce a paramter to control the number of KCQ to avoid performance drop if there is no kernel compute queue needed notes: this paramter only

RE: [PATCH 09/12] drm/amdgpu: define one macro for RAS's sysfs/debugfs name

2020-07-28 Thread Chen, Guchun
[AMD Public Use] Thanks Christian. Your suggestion looks better, let me improve it. Regards, Guchun -Original Message- From: Koenig, Christian Sent: Tuesday, July 28, 2020 3:55 PM To: Chen, Guchun ; amd-gfx@lists.freedesktop.org; Deucher, Alexander ; Zhang, Hawking ; Li, Dennis ;

Re: [PATCH 09/12] drm/amdgpu: define one macro for RAS's sysfs/debugfs name

2020-07-28 Thread Christian König
Am 28.07.20 um 09:49 schrieb Guchun Chen: Add one definition for the RAS module's FS name. It's used in both debugfs and sysfs case. Maybe better do this with a "static const char*". Christian. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 - 1

[PATCH 09/12] drm/amdgpu: define one macro for RAS's sysfs/debugfs name

2020-07-28 Thread Guchun Chen
Add one definition for the RAS module's FS name. It's used in both debugfs and sysfs case. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 - 1 file changed, 8 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

[PATCH 05/12] drm/amdgpu: skip bad page reservation once issuing from eeprom write

2020-07-28 Thread Guchun Chen
Once the ras recovery is issued from eeprom write itself, bad page reservation should be ignored, otherwise, recursive calling of writting to eeprom would happen. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 -- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 14

[PATCH 11/12] drm/amdgpu: disable page reservation when amdgpu_bad_page_threshold = 0

2020-07-28 Thread Guchun Chen
When amdgpu_bad_page_threshold = 0, bad page reservation stuffs are skipped in either UMC ECC irq or page retirement calling of sync flood isr. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 +++-- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 5 +++-- 2 files changed, 6

[PATCH 10/12] drm/amdgpu: decouple sysfs creating of bad page node

2020-07-28 Thread Guchun Chen
Bad page information should not be exposed by sysfs when bad page retirement is disabled, so decouple it from ras sysfs group creating, and add one guard before creating. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 71 - 1 file changed, 46

[PATCH 03/12] drm/amdgpu: add bad gpu tag definition

2020-07-28 Thread Guchun Chen
This tag will be hired for bad gpu detection in eeprom's access. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c | 3 +++ 1 file changed, 3 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c

[PATCH 12/12] drm/amdgpu: reset eeprom once specifying one bigger threshold

2020-07-28 Thread Guchun Chen
During driver's probe, when it hits bad gpu tag in eeprom i2c init calling(the tag was set when reported bad page reaches bad page threshold in last driver's working loop), there are some strategys to deal with the cases: 1. when the module parameter amdgpu_bad_page_threshold = 0, that means page

[PATCH 08/12] drm/amdgpu: restore ras flags when user resets eeprom

2020-07-28 Thread Guchun Chen
RAS flags needs to be cleaned as well when user requires one clean eeprom. v2: RAS flags shall be restored after eeprom reset succeeds. Signed-off-by: Guchun Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 ++--- 1 file changed, 10 insertions(+), 3 deletions(-) diff --git

[PATCH 07/12] drm/amdgpu: break GPU recovery once it's in bad state

2020-07-28 Thread Guchun Chen
When GPU executes recovery and retriving bad GPU tag from external eerpom device, the recovery will be broken and error message is printed as well for user's awareness. v2: Refine warning message in threshold reaching case, and fix spelling typo. v3: Fix explicit calling of bad gpu. v4:

[PATCH 06/12] drm/amdgpu: schedule ras recovery when reaching bad page threshold

2020-07-28 Thread Guchun Chen
Once the bad page saved to eeprom reaches the configured threshold, ras recovery will be issued to notify user. v2: Fix spelling typo. Signed-off-by: Guchun Chen --- .../gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c| 37 ++- 1 file changed, 36 insertions(+), 1 deletion(-) diff

[PATCH 04/12] drm/amdgpu: break driver init process when it's bad GPU

2020-07-28 Thread Guchun Chen
When retrieving bad gpu tag from eeprom, GPU init should fail as the GPU needs to be retired for further check. v2: Fix spelling typo, correct the condition to detect bad gpu tag and refine error message. v3: Refine function argument name. Signed-off-by: Guchun Chen ---

[PATCH 00/12] BAD GPU retirement policy by total bad pages

2020-07-28 Thread Guchun Chen
The series is to enable/disable bad page feature and apply different bad page reservation strategy by different bad page threshold configurations. When the saved bad pages written to eeprom reach the threshold, one ras recovery will be issued immediately and the recovery will fail to tell user

[PATCH 02/12] drm/amdgpu: validate bad page threshold in ras

2020-07-28 Thread Guchun Chen
Bad page threshold value should be valid in the range between -1 and max records length of eeprom. It could determine when saved bad pages exceed threshold value, and proceed corresponding actions. v2: When using the default typical value, it should be min value between typical value and eeprom

[PATCH 01/12] drm/amdgpu: add bad page count threshold in module parameter

2020-07-28 Thread Guchun Chen
bad_page_threshold could be configured to enable/disable the associated bad page retirement feature in RAS. When it's -1, ras will use typical bad page failure value to handle bad page retirement. When it's 0, disable bad page retirement, and no bad page will be recorded and saved. For other

[PATCH 1/4] drm: retrieve EDID via ACPI _DDC method

2020-07-28 Thread Daniel Dadap
Some notebook computer systems expose the EDID for the internal panel via the ACPI _DDC method. On some systems this is because the panel does not populate the hardware DDC lines, and on some systems with dynamic display muxes, _DDC is implemented to allow the internal panel's EDID to be read at

[PATCH 0/4] drm: add support for retrieving EDID via ACPI _DDC

2020-07-28 Thread Daniel Dadap
Some notebook systems provide the EDID for the internal panel via the _DDC method in ACPI, instead of or in addition to providing the EDID via DDC on LVDS/eDP. Add a DRM helper to search for an ACP _DDC method under the ACPI namespace for each VGA/3D controller, and return the first EDID

[PATCH 2/4] i915: fall back to ACPI EDID retrieval

2020-07-28 Thread Daniel Dadap
Fall back to retrieving the EDID via the ACPI _DDC method, when present for notebook internal panels, when EDID retrieval via the standard EDID paths is unsuccessful. Signed-off-by: Daniel Dadap --- drivers/gpu/drm/i915/display/intel_dp.c | 8 +++-

[PATCH 4/4] radeon: fall back to ACPI EDID retrieval

2020-07-28 Thread Daniel Dadap
Fall back to retrieving the EDID via the ACPI _DDC method, when present for notebook internal panels, when retrieving BIOS-embedded EDIDs. Signed-off-by: Daniel Dadap --- drivers/gpu/drm/radeon/radeon_combios.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git

[PATCH 3/4] nouveau: fall back to ACPI EDID retrieval

2020-07-28 Thread Daniel Dadap
Fall back to retrieving the EDID via the ACPI _DDC method, when present for notebook internal panels, when EDID retrieval via the standard EDID paths is unsuccessful. Signed-off-by: Daniel Dadap --- drivers/gpu/drm/nouveau/nouveau_connector.c | 6 ++ 1 file changed, 6 insertions(+) diff

RE: [PATCH 2/2] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit

2020-07-28 Thread Chen, Guchun
[AMD Public Use] Hi Tianci, My point is, as in the new patch, one new local adev variable is introduced, then in the same function, for others where smu->adev is used should be replace by the new local adev as well. Otherwise, it looks not perfect from coding style's perspective. Regards,

Re: [PATCH] drm/amdgpu/dc: Stop dma_resv_lock inversion in commit_tail

2020-07-28 Thread Christian König
Am 27.07.20 um 23:30 schrieb Daniel Vetter: Trying to grab dma_resv_lock while in commit_tail before we've done all the code that leads to the eventual signalling of the vblank event (which can be a dma_fence) is deadlock-y. Don't do that. Here the solution is easy because just grabbing locks

RE: [PATCH] drm/amdgpu: introduce a new parameter to configure how many KCQ we want(v2)

2020-07-28 Thread Liu, Monk
[AMD Official Use Only - Internal Distribution Only] Thanks Felix I reworked my patch with your suggestion and I can get queues evenly cross pipes, e.g.: modprobe amdgpu num_kcq=6 [ 409.878557] amdgpu :00:07.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0 [ 409.878559] amdgpu

Re: [PATCH 4/4] radeon: fall back to ACPI EDID retrieval

2020-07-28 Thread Christian König
Am 27.07.20 um 22:53 schrieb Daniel Dadap: Fall back to retrieving the EDID via the ACPI _DDC method, when present for notebook internal panels, when retrieving BIOS-embedded EDIDs. Signed-off-by: Daniel Dadap --- drivers/gpu/drm/radeon/radeon_combios.c | 6 +++--- 1 file changed, 3

RE: [PATCH 2/2] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit

2020-07-28 Thread Yin, Tianci (Rico)
Hi Guchun, Since the adev variable invoked a few times, local adev make the code more concise. Thanks! Rico -Original Message- From: Chen, Guchun Sent: Tuesday, July 28, 2020 2:26 PM To: Yin, Tianci (Rico) ; amd-gfx@lists.freedesktop.org Cc: Xu, Feifei ; Yin, Tianci (Rico) ; Tuikov,

RE: [PATCH 2/2] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit

2020-07-28 Thread Chen, Guchun
[AMD Public Use] One minor comment. Regards, Guchun -Original Message- From: amd-gfx On Behalf Of Tianci Yin Sent: Tuesday, July 28, 2020 1:27 PM To: amd-gfx@lists.freedesktop.org Cc: Xu, Feifei ; Yin, Tianci (Rico) ; Tuikov, Luben ; Hesik, Christopher ; Deucher, Alexander ; Swamy,

RE: [PATCH 2/2] drm/amdgpu: reconfigure spm golden settings on Navi1x after GFXOFF exit

2020-07-28 Thread Yin, Tianci (Rico)
[AMD Official Use Only - Internal Distribution Only] Thanks Feifei! Rico -Original Message- From: Xu, Feifei Sent: Tuesday, July 28, 2020 2:21 PM To: Yin, Tianci (Rico) ; amd-gfx@lists.freedesktop.org Cc: Tuikov, Luben ; Deucher, Alexander ; Zhang, Hawking ; Hesik, Christopher ;