Re: Mir GLAS vs Intel MKL: which is faster?

dextorious via Digitalmars-d Sat, 24 Sep 2016 10:11:37 -0700

First of all, awesome work. It's great to see that it's possibleto match or even exceed the performance of hand-crafted assemblyimplementations with generic code.

I would suggest adding more information on how the Eigen resultswere obtained. Unlike OpenBLAS, Eigen performance does often varyby compiler and varies greatly depending on the kind ofpreprocessor macros that are defined. In particular,EIGEN_NO_DEBUG is defined by default and reduces performance,EIGEN_FAST_MATH is not defined by default but can often increaseperformance and EIGEN_STACK_ALLOCATION_LIMIT matters greatly forperformance on very small matrices (where MKL and especiallyOpenBLAS are very inefficient). It's been a while since I've usedEigen, so I may have forgotten one or two.

It may also be worth noting in the blog post that these are allsingle threaded comparisons and multithreaded implementations areon the way. This is obvious to anyone who's followed thedevelopment of Mir, but a general audience on Reddit will likelypoint it out as a deficiency unless stated upfront.

Re: Mir GLAS vs Intel MKL: which is faster?

Reply via email to