[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-10-03 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

Arek Ruśniak  changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution|--- |FIXED

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-10-03 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #22 from Vedran Miletić  ---
The patch has been included in amd-staging-drm-next for a while, should this
bug be closed?

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-09 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

charlie  changed:

   What|Removed |Added

 CC||bug0xa...@hushmail.com

--- Comment #21 from charlie  ---
*** Bug 102598 has been marked as a duplicate of this bug. ***

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-09 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #20 from charlie  ---
I confirm that bug 102500 and bug 102598 are the same.

I split up the patch into 3 parts and they applied cleanly with offsets to
drm-next-4.15-wip.

I then reverted mesa to commit 214b565bc28bc4419f3eec29ab7bbe34080459fe
(winsys/amdgpu: set AMDGPU_GEM_CREATE_VM_ALWAYS_VALID if possible v2) compiled
and started X and corruption and lockups are gone.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-09 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #19 from charlie  ---
Bug 102500 might be related to bug 102598.

I tried to apply patch attachment 134082 to  amd-staging-4.12 (~agd5f/linux)
kernel and drm-next-4.15-wip but it does not apply cleanly.  I applied it
manually to drm-next-4.15-wip and that kernel would not finish compiling.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-09 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #18 from Vedran Miletić  ---
(In reply to Arek Ruśniak from comment #16)
> Patch fixes issue.
> I've tried both staging-4.12 and staging-drm-next branches.
> Thanks Christian
> 
> PS. It will be nice if Vedran could confirmed this for Vega before we close.

I can confirm that after applying the patch the issue doesn't occur for me. (I
hope that's enough, I can't claim more than that since I have done 2-3 upgrades
of mesa/llvm since I last tested the broken kernel.)

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-08 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #17 from Dieter Nützel  ---
(In reply to Christian König from comment #15)
> Created attachment 134082 [details] [review]
> Possible fix
> 
> Please try the attached kernel patch.

Hello Christian,

you've made your 'homework'...;-)

> To repeat my question: Does patch "drm/amdgpu: fix moved list handling in
> the VM" fix the issue?
>
> Do you guys have this in your kernel branch yet? If not that lockup is
> expected.

No, I haven't.
It was fallen into the cranks of the repeated DC rebase of Alex's
'amd-staging-drm-next' tree (didn't noticed it for the last 7 days, Alex
vacation). 
I'll make it short. NO that didn't solve it for me, too.

But _this_ patch is GOLD:
drm-amdgpu-fix-VM-sync-with-always-valid-BOs.mbox

Tested-by: Dieter Nützel 

Best 'glmark2' Score I've ever seen.
RX580, 8 GB
Xeon X3470, 4/8, 3 GHz
24 GB

glmark2 Score: 6428

with additional load on the gfx cores through parallel running
'opencl-example/run_tests.sh' I got

glmark2 Score: 7574

Good job!

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-08 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #16 from Arek Ruśniak  ---
Patch fixes issue.
I've tried both staging-4.12 and staging-drm-next branches.
Thanks Christian

PS. It will be nice if Vedran could confirmed this for Vega before we close.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-08 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #15 from Christian König  ---
Created attachment 134082
  --> https://bugs.freedesktop.org/attachment.cgi?id=134082=edit
Possible fix

Please try the attached kernel patch.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-08 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #14 from Christian König  ---
(In reply to Arek Ruśniak from comment #13)
> Christian sorry, I thought that was clear. 

No problem, that just means that this is the same issue I'm still hunting for.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-08 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #13 from Arek Ruśniak  ---
Christian sorry, I thought that was clear. 
Yes, I updated ASAP so it contains:
https://cgit.freedesktop.org/~agd5f/linux/commit/?h=amd-staging-4.12=8bd2cc0ab44b00346cc41f3ac828cbf992f6bc61
Doesn't help for vm-faults

Every test right before and after your comment is for: 
linux-amd-staging-4.12-c5def4cbdb61

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-08 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #12 from Christian König  ---
To repeat my question: Does patch "drm/amdgpu: fix moved list handling in the
VM" fix the issue?

Do you guys have this in your kernel branch yet? If not that lockup is
expected.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-07 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #11 from Arek Ruśniak  ---
on mesa side looks like this is it:

214b565bc28bc4419f3eec29ab7bbe34080459fe is the first bad commit
commit 214b565bc28bc4419f3eec29ab7bbe34080459fe
Author: Christian König 
Date:   Tue Aug 29 16:45:46 2017 +0200

winsys/amdgpu: set AMDGPU_GEM_CREATE_VM_ALWAYS_VALID if possible v2

When the kernel supports it set the local flag and
stop adding those BOs to the BO list.

Can probably be optimized much more.

v2: rename new flag to AMDGPU_GEM_CREATE_VM_ALWAYS_VALID

Reviewed-by: Marek Olšák 

:04 04 2e4b2737f37ede2bbdbbe6815fe0fa562177c2b7
3482c86ed92116adff7ab12b2d4de870746a1df6 M  src

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-07 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #10 from Arek Ruśniak  ---
additional info:
I try figure out why in my earlier test everything went ok and probably mesa is
the trigger, 

Linux-amd-staging + Mesa-git + LLVM-svn - failure
Linux-amd-staging + Mesa-git + LLVM 4.0.1 - failure
Linux-amd-staging + Mesa 17.1.8 + LLVM 4.0.1 - works ok. 
I try later some bisecting, we will see.

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel


[Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup

2017-09-04 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=102500

Vedran Miletić  changed:

   What|Removed |Added

 CC||ved...@miletic.net
Summary|[polaris10][amd-staging-4.1 |[polaris10,
   |2] GPU fault detected,  |vega10][amd-staging-4.12,
   |somethimes lockup   |amd-staging-drm-next] GPU
   ||fault detected, somethimes
   ||lockup

-- 
You are receiving this mail because:
You are the assignee for the bug.___
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel