On Thu, 2017-06-22 at 23:57 +0200, Joerg Roedel wrote: > On Thu, Jun 22, 2017 at 11:13:09AM -0400, Jan Vesely wrote: > > It looks like I tested different patches. > > linux-4.10.17 with both > > "iommu/amd: Optimize iova queue flushing" > > This patch isn't in my tree and will not go upstream. > > > and > > "iommu/amd: Disable previously enabled IOMMUs at boot" > > This patch solves a different problem. > > > (I haven't tested the series independently) > > > > works OK. The machine booted successfully and I was able to test clover > > based OpenCL and simple OpenGL on both iGPU(carrizo) and dGPU(iceland). > > For a conclusive test please use what is in the iommu-tree, as this is > what I plan to send upstream. You can use the 'next' branch of > > git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu.git
Tested commit c71bf5f133056aae71e8ae7ea66240574bd44f54. The machine boots and runs OK, although it takes few minutes to boot up (looks USB related). OpenGL and OpenCL run OK on both GPUs. I was able to trigger "Completion-Wait loop timed out" messages in the following situation: Hung OpenCL task running on dGPU. dGPU goes to sleep. sigterm to hung task. it seems to recover OK after the dGPU is powered back on dmesg: [ 1628.049683] amdgpu: [powerplay] VI should always have 2 performance levels [ 1628.845195] amdgpu 0000:07:00.0: GPU pci config reset [ 1667.270351] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.270437] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.270491] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.270505] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.270556] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.270607] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.270614] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.270664] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.270714] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.270721] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.270770] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.270846] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.270868] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.270922] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.270982] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.270992] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.271043] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.271096] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.271109] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.271164] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.271230] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.271245] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.271338] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.271394] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.271403] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.271458] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.271518] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.271533] amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0> [ 1667.271588] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) [ 1667.271644] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job! [ 1667.426742] AMD-Vi: Completion-Wait loop timed out [ 1667.570025] AMD-Vi: Completion-Wait loop timed out [ 1667.713326] AMD-Vi: Completion-Wait loop timed out [ 1667.867561] AMD-Vi: Completion-Wait loop timed out [ 1668.010886] AMD-Vi: Completion-Wait loop timed out [ 1668.154207] AMD-Vi: Completion-Wait loop timed out [ 1668.283193] AMD-Vi: Event logged [ [ 1668.283201] IOTLB_INV_TIMEOUT device=07:00.0 address=0x000000040ce6e240] [ 1668.430357] AMD-Vi: Completion-Wait loop timed out [ 1668.581169] AMD-Vi: Completion-Wait loop timed out [ 1668.718046] AMD-Vi: Completion-Wait loop timed out [ 1668.854914] AMD-Vi: Completion-Wait loop timed out [ 1668.991774] AMD-Vi: Completion-Wait loop timed out [ 1669.128638] AMD-Vi: Completion-Wait loop timed out [ 1669.272391] AMD-Vi: Completion-Wait loop timed out [ 1669.285193] AMD-Vi: Event logged [ [ 1669.285200] IOTLB_INV_TIMEOUT device=07:00.0 address=0x000000040ce6e2b0] [ 1669.285756] [drm] PCIE GART of 3072M enabled (table at 0x0000000000040000). [ 1669.288274] amdgpu: [powerplay] can't get the mac of 5 [ 1669.302600] [drm] ring test on 0 succeeded in 16 usecs [ 1669.302987] [drm] ring test on 1 succeeded in 17 usecs [ 1669.303037] [drm] ring test on 2 succeeded in 21 usecs [ 1669.303063] [drm] ring test on 3 succeeded in 10 usecs [ 1669.303088] [drm] ring test on 4 succeeded in 10 usecs [ 1669.303114] [drm] ring test on 5 succeeded in 10 usecs [ 1669.303142] [drm] ring test on 6 succeeded in 11 usecs [ 1669.303167] [drm] ring test on 7 succeeded in 10 usecs [ 1669.303195] [drm] ring test on 8 succeeded in 11 usecs [ 1669.303229] [drm] ring test on 9 succeeded in 3 usecs [ 1669.303235] [drm] ring test on 10 succeeded in 3 usecs [ 1675.029247] amdgpu: [powerplay] VI should always have 2 performance levels [ 1675.823322] amdgpu 0000:07:00.0: GPU pci config reset lspci: 07:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Topaz XT [Radeon R7 M260/M265 / M340/M360 / M440/M445] (rev ff) Jan > > to get all patches, including my flush optimization series. > > > Thanks, > > Joerg >
signature.asc
Description: This is a digitally signed message part
_______________________________________________ iommu mailing list [email protected] https://lists.linuxfoundation.org/mailman/listinfo/iommu
