On Thu, Feb 22, 2007 at 10:09:28PM -0800, Roland Dreier wrote: > (BTW, one thing I did notice while looking at the i386 assembly is > that one micro-optimization that might make sense to use something > like __attribute__((regparm(3))) for internal function calls within > libibverbs and libmthca on i386, since otherwise we waste instructions > pushing stuff on the stack for no reason other than compliance with > the crufty old i386 ABI. Something like a FASTCALL macro in > <infiniband/arch.h> perhaps... if anyone really cares about 32-bit > i386 performance any more)
Newer gccs have the -fwhole-program --combine options that address this and more. One of the things that happens is that all internal functions are made 'static' and all compilation units are optimized in one go. gcc will optimize calling convention and alot of other things for static functions. That should provide an across the board micro-improvement even on x86-64. Jason _______________________________________________ openib-general mailing list [email protected] http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
