Karl Rupp r...@iue.tuwien.ac.at writes:
Okay, so this shows ~150 us overhead for representing the types as in
pyviennacl. Compared with a pure CPU-implementation, this is certainly a
lot.
However, when comparing this with OpenCL, then the overhead is actually
not too bad and becomes
Hi Philippe,
Philippe Tillet phil.til...@gmail.com writes:
Well, it really depends on applications. 2**15 is actually still fairly
small, since it is not exactly big enough to be bandwidth-limited (as
opposed to kernel-launch overhead limited). I'd say considering the low
bandwidth of PCI-E