On Monday, 14 November 2016 at 08:41:36 UTC, Ilya Yaroshenko
wrote:
Yes, I will use libFlame approach, it is a really good. In the
same time Dlang is more user-friendly for SIMD optimization.
GLAS routines probably will follow hypothetical (it is closed
source) Intel MKL approach but without unrolled loops. LibFLAME
has 2 kinds of algorithms: blocking and unblocking. GLAS where
it is possible will have 3 kinds: tiny unblocking, register
(SIMD) blocking, and normal blocking.
That is great!
I hope being able to use your lib soon,
Again thank you for sharing and for these impressive results!
Vincent