On Wed, Nov 10, 2010 at 22:08, e-mail number.cruncher < number.crunc...@ntlworld.com> wrote:
> In short, someone from Intel submitted a glibc patch that does faster > memcpy's on e.g. Intel i7, respects the ISO C definition, but does > things backwards. > However, the commit message and mailing list, as far as I can tell, does not explain how the implementations were benchmarked. Linus claims that his (entirely trivial) implementation matches or beats the new one. If indeed the performance gains claimed by Lu (2X to 4X) are real, then the old implementation must have been truly horrible (as stated by Agner Fog in http://sourceware.org/ml/libc-help/2008-08/msg00007.html). I'd like to see the benchmark results demonstrating that the backward memcpy is really faster than forward. > I think any software that ignores the ISO warning > "If copying takes place between objects that overlap, the behavior is > undefined" needs fixing. > Absolutely, it is incorrect and should be fixed. Jed