"Brian J. Beesley" wrote:

> Could I suggest that your figures may be a bit misleading. The point 
> is that, when the remaindering operation kicks in, roundoff errors 
> start to take effect & MacLucasUNIX generally restarts with the next 
> higher FFT size. You should really be running at least 100 iterations 
> to be sure that you have the appropriate FFT size for the exponent, 
> and that the timing isn't distorted by not running any code needed to 
> implement the remaindering operation.

I actually ran all tests (except for 332192831 - the 16M FFT was just
too slow :) for about 170 iterations, but just picked a single result.
Maybe the data would have been more meaningful if I'd left the exponent
out and just reported the FFT length.

> I find, running MLU on a Alpha 21164-533, 128K FFT works up to about 
> exponent 2.35 million, & pro rata. MLU on a Sparc seems to be able to 
> run a bit higher, somewhere around 2.45 million seems to be OK for a 
> 128K FFT. Mind you, a Ultra IIi-300 is only about 0.4x the speed of a 
> Alpha 21164-533, running MLU compiled using gcc 2.8.1 on both 
> systems.

I've got static MLU ev5 and ev6 binaries if Linux can run them under
some sort of Digital Unix emulation.  Might be interesting to see how
gcc 2.8.1 compares with DEC's C compiler.

Look at ftp://melanoma.cs.rmit.edu.au/pub/simonb/MLU-ALPHA.tar.gz for
the binaries.

> The timings I have - from complete double tests - are 
> 128K FFT, 25000 iters/27 minutes = 0.065 sec/iter
> 256K FFT, 10000 iters/31 minutes = 0.186 sec/iter
> 512K FFT, 5000 iters/27 minutes = 0.324 sec/iter

No matter which way you look at it, the 21264 is fast :-)

> For short tests of 400 iterations (for QA testing) I've run lucdwt 
> (from Richard Crandall's giantint package, with minor modifications 
> to output) on exponents up to nearly 80 million i.e. 4096K FFT. This 
> just about fitted into 256MB; I was unable to proceed to 8192K FFT 
> since I have only 320 MB on my system (& can't justify buying any 
> more - in any case, Prime95 v19 gives up at ~79,600,000). These tests 
> were running at 90 to 100 iterations per hour for 4096K FFT. See 
> ftp://lettuce.edsc.ulst.ac.uk/gimps/PrimeQA/QADATA.TXT

The file format seems to be

        exponent,iter-count,residue,??,??

Does the "lucdwt" mean that lucdwt was used to generate the file (and
Prime95 is tested against it)?  And what's the last field (which some
lines don't have)?

> BTW here in the UK you can purchase a complete Alpha 21164-533 system 
> with a decent hard drive & 128MB RAM, preloaded with RedHat linux, 
> for under 1500 pounds sterling.

A few of us Aussies recently purchased some PC164-500 motherboards
(500MHz 21164) for $US250 - easily the fastest computer I own now!

Simon.
_________________________________________________________________
Unsubscribe & list info -- http://www.scruz.net/~luke/signup.htm
Mersenne Prime FAQ      -- http://www.tasam.com/~lrwiman/FAQ-mers

Reply via email to