On Fri, Feb 9, 2018 at 11:02 AM, Joerg Roedel <jroe...@suse.de> wrote: > > Okay, I used movsb because I remembered that being the recommendation > for the most efficient memcpy, and it safes me an instruction. But that > is probably only true on modern CPUs.
Yeah, it's only true on the very latest uarchs, and even there it's not perfect for small copies. On the older machines that are relevant for 32-bit code, it's often tens of cycles just for the ucode overhead, I think, and "rep movsb" actually does things literally a byte at a time. Linus