Hi,

 > I've already told this on IRC. The GEMV uses very conservative profiles
> with very few threads. Now that I have ported a simple version of GEMM
> (when only full matrices are used), I'll re-bind the generator into
> pyviennacl and will try to get an auto-tuning up and runing in python.
> Then, I'll update the profiles to something better :)

What about using a default local workgroup size of 128 and a default 
global workgroup size of 128*128? This has worked fairly well for years 
now and will get him fairly close to peak on current hardware?

Best regards,
Karli

------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
ViennaCL-devel mailing list
ViennaCL-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/viennacl-devel

Reply via email to