Hi Albert, > The standard CPU backend doesn't seem to use multiple CPUs (e.g. for > matrix multiplication).
This depends on the operation. Some are OpenMP-enabled, others aren't yet. So far we needed to focus on OpenCL and CUDA, with the CPU/OpenMP backends being used mostly for fallback purposes. If you're limited by matrix-matrix multiplications, you might want to give the OpenCL backend on the CPU a shot. > Is this planned? In some cases, I see some OpenMP pragmas in the code, > but not everywhere, esp. not on matrix-matrix products, which is my > most used operation. Has that just not been tested yet or is there a > reason? It has not been thoroughly tested and optimized yet. Our observation was that many people use vendor-libraries for big dense matrix-matrix multiplications anyway, so we better focused on the many 'other' linear algebra operations and the API. Adding OpenMP-pragmas to the current matrix-matrix multiplication would be easy, but we better improve cache utilization through blocking first. If there is user demand, we are happy to assign higher priority to this :-) Best regards, Karli ------------------------------------------------------------------------------ Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk _______________________________________________ ViennaCL-devel mailing list ViennaCL-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/viennacl-devel