On 6/2/06, Canqun Yang <[EMAIL PROTECTED]> wrote:
This patch results a performance increase of 4% for SPECfp2000 and 13% for NAS 
benchmark suite on
Itanium-2 system, respectively. More performance increase is hopeful by further 
tuning the
parameters and improving the prefetch algorithm at tree level.

Bravo.

--- ia64.h (revision 114307)
+++ ia64.h (working copy)
@@ -1985,13 +1985,18 @@
   ??? This number is bogus and needs to be replaced before the value is
   actually used in optimizations.  */

-#define SIMULTANEOUS_PREFETCHES 6
+#define SIMULTANEOUS_PREFETCHES 18

Is the number still bogus as the comment suggests, or is there a
rationale for 18?  It looks quite high.

+/* A number that should roughly corresponding to the nunmber of instructions
+   executed before the prefetch is completed.  */
+
+#define PREFETCH_LATENCY 400

Likewise.  Is 400 cycles the memory latency on itanium-2?

Gr.
Steven

Reply via email to