csullivan opened a new pull request #7834:
URL: https://github.com/apache/tvm/pull/7834


   I have encountered a few pathological bugs in the opencl compiler provided 
on the snapdragon android platform (e.g. opencl compiler hung for 5+ hours in 
call to clBuildProgram, and non-deterministic emission of 
`cl_a6x_cmdbuf_mgr_submit_ibs`). I've isolated them into a minimal reproducible 
example, and find that they occur only when all kernels are created from a 
single cl_program. If instead a cl_program is created for each kernel, these 
issues are avoided. 
   
   This PR proposes the addition of a kernel primitive delimiter to be added to 
the OpenCL code generation, and for the OpenCL module runtime to utilize this 
delimiter to build and cache separate cl_programs for each generated kernel 
source.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to