Vladimir Panteleev wrote:
On Monday, 26 December 2011 at 17:37:17 UTC, Piotr Szturmaj wrote:
Yes. Here are the results: http://pastebin.com/rD8kiaQy. This is
observed only with Windows DMD.
I'd be more interested in seeing the code.
Sorry for late answer. For memcpy cases code is the same as in my github
Phobos fork. Here is the change to slice copying:
http://pastebin.com/EteqEper
I've done some more research on this. In release builds, DMD on Windows
emits a memcpy call for a slice copy. However, the auto-generated memcpy
call has slightly less overhead (register/stack shuffling) than a manual
memcpy call, which explains the performance difference I was seeing.