Hi,

 > Yes, the default NVidia profile for double precision uses a work group
> size of 1024... All this is checked during the autotuning procedure so
> that it will work for the hardware it's tunned for...
> Meh, seems like we need a couple additional levels of abstraction to
> reach safety.

In this case it's supposed to be only a matter of detecting the device 
generation properly (pre-Fermi).

Best regards,
Karli


------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead. 
Download for free and get started troubleshooting in minutes. 
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
_______________________________________________
ViennaCL-devel mailing list
ViennaCL-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/viennacl-devel

Reply via email to