Hi Albert,

 > The standard CPU backend doesn't seem to use multiple CPUs (e.g. for
> matrix multiplication).

This depends on the operation. Some are OpenMP-enabled, others aren't 
yet. So far we needed to focus on OpenCL and CUDA, with the CPU/OpenMP 
backends being used mostly for fallback purposes. If you're limited by 
matrix-matrix multiplications, you might want to give the OpenCL backend 
on the CPU a shot.


> Is this planned? In some cases, I see some OpenMP pragmas in the code,
> but not everywhere, esp. not on matrix-matrix products, which is my
> most used operation. Has that just not been tested yet or is there a
> reason?

It has not been thoroughly tested and optimized yet. Our observation was 
that many people use vendor-libraries for big dense matrix-matrix 
multiplications anyway, so we better focused on the many 'other' linear 
algebra operations and the API. Adding OpenMP-pragmas to the current 
matrix-matrix multiplication would be easy, but we better improve cache 
utilization through blocking first. If there is user demand, we are 
happy to assign higher priority to this :-)

Best regards,
Karli


------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
ViennaCL-devel mailing list
ViennaCL-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/viennacl-devel

Reply via email to