Hey,
2014-08-17 11:52 GMT+02:00 Karl Rupp r...@iue.tuwien.ac.at:
Hi,
So it seems like most of the features are ready for ViennaCL 1.6. My
merge from a few days ago (finally) fully integrated the use of
device-specific kernels for BLAS1, BLAS2, BLAS3.
hurray! :-)
The reduction API
Hi,
* nmf only implements matrixT, but in principle matrix_baseT
should
work (since no custom kernel is called, I believe)
NMF uses a custom kernel and thus only works with OpenCL. A
generalization to matrix_base should be straight-forward, yes. I
should be able to do
Hey,
The nasty bug on strided GEMV got solved.
I'm available on wednesday for the code uniformization session. We should
be on IRC at the same time, though, in case we face a situation we had not
discussed. I have a couple of questions regarding a standardized way of
naming the numeric type of a
Hi all,
I just pushed the first working version of expert(custom) benchmark mode.
Selecting custom sparse matrices is yet to be implemented, but all other
benchmark configs are working.
Except blas3, that is. I think I got the sizes wrong. I'd appreciate it if
someone could check if I did it
Hi Namik,
I just pushed the first working version of expert(custom) benchmark
mode. Selecting custom sparse matrices is yet to be implemented, but all
other benchmark configs are working.
thanks! I'll comment on it tomorrow.
Except blas3, that is. I think I got the sizes wrong. I'd
Namik,
I just tried to build it and got a couple errors. I made sure to run 'git
submodule update --init' and I started with a fresh build folder for
cmake. Since I know this is in active development, I thought I would ask
first before trying to fix it and submitting a pull request.
Here are
Hey Matt,
'uint64_t' : undeclared identifier
This looks like a compiler hick-up. I suppose this occured with MSVC? Feel
free to submit a fix.
Regards, Namik
On Mon, Aug 18, 2014 at 3:54 AM, Matthew Musto matthew.mu...@gmail.com
wrote:
Namik,
I just tried to build it and got a couple
I forgot my Visual Studio with Qt is broken :) so I can't test MSVC
CMake... I did, however, test mingw cmake, and it works fine.
On Mon, Aug 18, 2014 at 4:33 AM, Namik Karovic namik.karo...@gmail.com
wrote:
I just got it to work with MSVC10 Qt5.3.1. Changed uint64_t to uint.
Don't think
Hey Namik,
The code looks fine. As a small tip, I would advise to use
blas3MatrixSize{A,B,C} = {M, N, K} ; it's much more conventional. I would
also suggest to remove LU from the benchmark. I only achieve 11 GFLOP/s on
my machine (GEMM peaks at 120GFLOP/s). It will smash the overall score if
you