Hi guys, the scheduler for kernel fusion makes good progress. Toby, you should be able to use all of the fundamental dense linear algebra operations now. There should be only be two blocks of functionality missing: - Sparse matrices (i.e. matrix-vector products) - In some cases where += and -= may not work (e.g. matrix-vector product)
Compilation times are moderate, but there is also some room for improvement left. Matrix-matrix products are unnecessarily heavy on the compiler. The good news for today is that things are finally growing together: Via the scheduler Toby can make the fast kernels from Philippe's generator available to the Python community :-) Best regards, Karli ------------------------------------------------------------------------------ Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with <2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk _______________________________________________ ViennaCL-devel mailing list ViennaCL-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/viennacl-devel