On 04/23/12 15:32, Torbjorn Granlund wrote: > Richard Henderson <[email protected]> writes: > > On 04/23/12 07:49, Torbjorn Granlund wrote: > > Do you know the repeat rate of umull, umlal, umaal, assuming no reg > > dependencies? > > For a8: 3 cycles. > > For a9 it seems to be 2 cycles, so 3.25 c/l for the current addmul_1 is > not very good. > I have found no timing docs, so I measured it myself:
arm.com has them, free registration required. Table B.5. Multiplication instruction cycle timings Instruction Cycles Result latency MUL(S), MLA(S) 2 4 SMULL(S), UMULL(S), SMLAL(S), UMLAL(S) 3 4 for the first written register 5 for the second written register UMAAL 3 4 for the first written register 5 for the second written register There doesn't seem to be any clear indication of the repeat rate. One is lead to believe that the multiplier is fully pipelined, but your experiment suggests that it isn't. r~ _______________________________________________ gmp-devel mailing list [email protected] http://gmplib.org/mailman/listinfo/gmp-devel
