Hi,
> Yes, the default NVidia profile for double precision uses a work group > size of 1024... All this is checked during the autotuning procedure so > that it will work for the hardware it's tunned for... > Meh, seems like we need a couple additional levels of abstraction to > reach safety. In this case it's supposed to be only a matter of detecting the device generation properly (pre-Fermi). Best regards, Karli ------------------------------------------------------------------------------ Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with <2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk _______________________________________________ ViennaCL-devel mailing list ViennaCL-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/viennacl-devel