Hi,

I am creating a custom target which will run just the kernels, I want 
the kernels to be compiled and linked statically with everything needed 
to run the kernel included. So I believe the standalone method of 
compilation is right, is this correct?

I am using a configurable VLIW so I will want to combine work items to 
expose ILP but the device is also multi-core. I am compiling the simple 
standalone example and just would like to know how get_global_id is 
calculated in the produced bytecode? And how I should use the 
_*kernel*_workgroup and _*kernel*_workgroup_fast functions? I'm trying 
to go through the code, but without comments, I am not making very fast 
progress.

Many thanks,

-- 
Sam Parker
Research Student
Electronic System Design Group
School of Electronic, Electrical and Systems Engineering
Loughborough University


------------------------------------------------------------------------------
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev
_______________________________________________
pocl-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/pocl-devel

Reply via email to