On Sat, 19 Feb 2011 18:13:44 +0100, Sebastian Haase wrote:
> Thanks a lot. Very informative. I guess what you say about "cache line
> is dirtied" is related to the info I got with valgrind (see my email in
> this thread: L1 Data Write Miss 3636). Can one assume that the cache
> line is always a few mega bytes ?

Cache lines are typically much smaller, 16-512 bytes.

In this specific case, since the stride of the `i` loop is only 
2*sizeof(float) = 16 bytes << cache line size, threads running with 
different `i` tend to write to the same cache lines.

-- 
Pauli Virtanen

_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@scipy.org
http://mail.scipy.org/mailman/listinfo/numpy-discussion

Reply via email to