Re: [pocl-devel] NVIDIA device backend for POCL

Peter Colberg Fri, 22 Mar 2013 17:23:03 -0700

On Sat, Mar 23, 2013 at 01:55:30AM +0200, Pekka Jääskeläinen wrote:
> Also, the kernel uses "automatic locals" in contrast to the
> "host allocated" kernel local args, if that makes any difference.


That is the point. Automatic local memory arrays (with static size)
are supported by both CUDA and OpenCL, so in this case cuModuleLoad*
will have no problem loading PTX generated from OpenCL using LLVM
NVPTX.

The issue is local and constant memory passed by parameter. The CUDA
driver rejects these cases, despite supporting them in OpenCL.

This gives the impression that the real issue with this pocl backend
will not be the LLVM NVPTX backend, but the CUDA driver, which is
outside the realm of LLVM. In the above case, the CUDA driver rejects
valid PTX code without giving any reason.

Peter

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
pocl-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/pocl-devel

Re: [pocl-devel] NVIDIA device backend for POCL

Reply via email to