Hello again, I compiled one of te examples. I used sparse matix called 02-raefsky3. I used -vec_type cuda and -mat_type seqaijcuda.
When I see summary of the operations performed by program there is MatMult 1 1.0 2.0237e-02 1.0 2.98e+06 1.0 0.0e+00 0.0e+00 0.0e+00 2100 0 0 0 2100 0 0 0 147 Does time of performing MatMult includes memory transfer for loading matrix in GPU memory or just exact computation time? Thanks in advance. Kuba.
