On 18 Aug 99, at 13:37, Simon Burge wrote:

> % ./MacLucasUNIX -C -S 10 1400001
> speed: 10 iters in  0.152 seconds, 0.015 iters/sec (fft len   64k)
> % ./MacLucasUNIX -C -S 10 2900001
> speed: 10 iters in  0.362 seconds, 0.036 iters/sec (fft len  128k)
> % ./MacLucasUNIX -C -S 10 5800001
> speed: 10 iters in  0.782 seconds, 0.078 iters/sec (fft len  256k)
> % ./MacLucasUNIX -C -S 10 11600001
> speed: 10 iters in  1.777 seconds, 0.178 iters/sec (fft len  512k)
> % ./MacLucasUNIX -C -S 10 23200001
> speed: 10 iters in  4.606 seconds, 0.461 iters/sec (fft len 1024k)
> % ./MacLucasUNIX -C -S 10 46400001
> speed: 10 iters in 13.601 seconds, 1.360 iters/sec (fft len 2048k)

Interesting.

Could I suggest that your figures may be a bit misleading. The point 
is that, when the remaindering operation kicks in, roundoff errors 
start to take effect & MacLucasUNIX generally restarts with the next 
higher FFT size. You should really be running at least 100 iterations 
to be sure that you have the appropriate FFT size for the exponent, 
and that the timing isn't distorted by not running any code needed to 
implement the remaindering operation.

I find, running MLU on a Alpha 21164-533, 128K FFT works up to about 
exponent 2.35 million, & pro rata. MLU on a Sparc seems to be able to 
run a bit higher, somewhere around 2.45 million seems to be OK for a 
128K FFT. Mind you, a Ultra IIi-300 is only about 0.4x the speed of a 
Alpha 21164-533, running MLU compiled using gcc 2.8.1 on both 
systems.

The timings I have - from complete double tests - are 
128K FFT, 25000 iters/27 minutes = 0.065 sec/iter
256K FFT, 10000 iters/31 minutes = 0.186 sec/iter
512K FFT, 5000 iters/27 minutes = 0.324 sec/iter

(Ultra IIi-300, 256K FFT, 5000 iters/33 mins = 0.396 sec/iter)

512K FFT runs nicely in only 64MB, but 1024K wouldn't.

For short tests of 400 iterations (for QA testing) I've run lucdwt 
(from Richard Crandall's giantint package, with minor modifications 
to output) on exponents up to nearly 80 million i.e. 4096K FFT. This 
just about fitted into 256MB; I was unable to proceed to 8192K FFT 
since I have only 320 MB on my system (& can't justify buying any 
more - in any case, Prime95 v19 gives up at ~79,600,000). These tests 
were running at 90 to 100 iterations per hour for 4096K FFT. See 
ftp://lettuce.edsc.ulst.ac.uk/gimps/PrimeQA/QADATA.TXT

BTW here in the UK you can purchase a complete Alpha 21164-533 system 
with a decent hard drive & 128MB RAM, preloaded with RedHat linux, 
for under 1500 pounds sterling. See http://www.compusys.co.uk/
(please forgive me for "advertising"; I have no connection to this 
company except as a customer).

Regards
Brian Beesley
_________________________________________________________________
Unsubscribe & list info -- http://www.scruz.net/~luke/signup.htm
Mersenne Prime FAQ      -- http://www.tasam.com/~lrwiman/FAQ-mers

Reply via email to