----------------------------
227-digit SNFS factorization
------------------------------------------------
This announcement can be ftp-ed from:
ftp://ftp.cwi.nl/pub/herman/SNFSgiants/SNFS-227
------------------------------------------------

The factorization of the smallest composite Mersenne number without
known factor, M751 = 2^751 -1 = N, was completed on January 22, 2002,
using the Special Number Field Sieve (SNFS).  The sieving was primarily
done at CWI, at Lehigh University, the Institute for Pure Math, Bonn
University and the Institute for Applied Math, Bonn University;
with additional contributions from Arjen Lenstra, Citicorp and Paul Leyland,
Microsoft Research.  Two notable features as compared to previous large
collaborative SNFS factorizations are the use of three distinct sievers
(rather than two;  the new addition being Jens Franke's lattice siever
with rational special primes q) and further exploration of the fast linear
algebra provided by parallel Lanczos on the SARA Computing Center's Origin
3800 (an upgrade of the previous standard; our previous program used a
single CPU of the 12-processor Cray C90).

A Table of current factorization dates is given below. This is the 3rd
(200+)-digit SNFS factorization done subsequent to the factorization
of RSA-512 (ftp://ftp.cwi.nl/pub/herman/GNFSrecords/GNFS-512).
First, we found three prime factors from the 227-digit composite cofactor
(2^773+1)/(3*533371), the smallest having 55-digits.
Next was M727 = 2^727-1, which we found to have just two prime factors, the
smallest with 98-digits (cf. Peter Montgomery's post, for example at

 http://www.lehigh.edu/~bad0/msg06332.html

taken from the Mersenne archive).  The present factorization, of the
just slightly smaller 227-digit M751 = 2^751-1, is

**************
c227 = p1.p2.p3

p1 = 22764024512532445092774588186840266769462045797638178267254980648\
     7
     (66 digits)

p2 = 64935003199370114500077042183338517573218246028593057485230891498
     97
     (67 digits)

p3 = 80130680840326532624075176541872828138563939790501465580817976318\
     12307901576375946587821853073
     (94 digits)
**************

TABLE of Factorization Dates:

Number     2,773+   10,211- 2,727-  2,751- RSA-140 RSA-155a RSA-155b RSA-158
Factored by                                 Cabal   Cabal   Granlund Franke
                                                             et al.  et al.
Method      SNFS      SNFS    SNFS   SNFS   GNFS    GNFS     GNFS    GNFS
#decimals   233       211     219    227    140     155      155     158
p sizes   55,71,102 93,118  98,122 66,67,94 70,70   78,78    78,78   73,86
Factored    Nov       Apr     Aug    Jan    Feb     Aug      Oct     Jan
in          2000      1999    2001   2000   1999    1999     2000    2002


For 2,751- we used the two polynomials

            f(X) = 2X^6 - 1
            g(X) = X - 2^125

with common root m = 2^125 (mod N).

All relations used had large primes bounded by 10^9 (although some
relations were found by sieving with a bound of 4*10^8).  The relations
found at CWI used large memory line-sieving jobs, with factor base bounds
of 110M.  Lattice sieving with algebraic special-q was done by Dodson,
Lenstra and Leyland, with factor base bounds 16.7M.  (These relations
are referred to below as "Dodson"'s.)  Lattice sieving with rational
special-q was done by Franke (who acknowledges the contribution of computing
time from members of the scientific computing department at Bonn and useful
suggestions by T. Kleinjung for improving the siever code).  Franke's
initial relations were found with a small sieving region and factor bases;
but most were found with a sieving region of 16385x8192 (similar to the
region used on the algebraic side), and factor base bounds 32M.

Use of three sievers introduces additional opportunity for a single
relation to be found more than once, which complicates both reliably
determining when sieving is complete and evaluating the relative
performance of the sievers.  We describe the numbers of relations
found so as to clarify this point.  First, line sieving produces no
self-duplicates, and CWI's total of 24.058 million relations were all
distinct.  Dodson's relations were filtered at Lehigh to remove duplicates
(which occur when a single relation occurs with more than one algebraic
special-q in the range of q's that were sieved), with 32.500 million
distinct relations. Franke still had data from the recent GNFS 158-digit
record occupying disk space, and sent most of his raw data by mail to CWI,
on 3 CD's. After removing duplicates (with more than one rational special-q)
Franke had 44.442 million distinct relations.  We analyzed duplications
among these three sets of relations, corresponding to the three sieving
programs, a total of 101 million relations.
There were a total of 84.314 million distinct relations, which occurred
as follows:

       13.361 M by CWI only
       24.408 M by Dodson only
       30.928 M by Franke only
        2.100 M by both CWI and Dodson
        7.523 M by both CWI and Franke
        4.917 M by both Dodson and Franke
        1.073 M by all three sieving programs
      -------
       84.314 M total distinct (exact values available from Peter,
                                there was a perfect match in the counts)

which turns out to have been about 10% more than we needed, allowing
additional heavy rows to be removed (in the filtering step) before starting
the matrix.  We observe that, in this instance, Franke's relations
(rational q's) duplicated CWI's (line sieving) more than Dodson's
(algebraic q's);  a phenomenon that may be explained by the small sieving
region used to find the initial c. 20% of the relations.

A filter program [Stefania Cavallar, Strategies in filtering in the number
field sieve, pp. 209-231 in: Wieb Bosma (Ed.), Algorithmic Number Theory
(4th International Symposium, ANTS-IV, Leiden, The Netherlands, July 2000),
LNCS # 1838, Berlin, Springer, 2000] transformed the 84.3M relations into
5.1 million sets, with 4.9 million ideals of norm > 2 million appearing
somewhere with odd exponents.  These sets had a combined 28.0 million
relations (some appearing in multiple sets), with at most 13 relations per
set.  After appending extra columns for free relations and extra rows for
prime ideals < 2 million (but above 180), the matrix had 5165790 rows and
5185901 columns with 359525164 1's, 69 entries/row.

The Block Lanczos program took 64 hours elapsed time on 25 processors, as
compared to 25 hours on 25 processors for the M727-matrix. The processors
are 500 MHz MIPS R14000.  Total calendar time for processing the sieving
data and getting the factorization, after its arrival at CWI, was under two
weeks.

We ran two concurrent copies of the square root program on CWI's
32-processor Origin 2000, each copy using one processor.
The first copy got a trivial factorization on its first dependency, later
finding the p66 on its second dependency and p67 on its third dependency.
This copy took 10.5 hours/dependency plus 3.6 hours for quadratic
character tests (35 hours overall).
The second copy, which took only 6.1 hours/dependency due to a different
tuning parameter, found the p94 three times before the first copy had found
the p66. Later the second copy found p94 a fourth time, followed by a
trivial factorization and finally p66 (40 hours overall).

* P +-1 factors

227640245125324450927745881868402667694620457976381782672549806487
    p66
227640245125324450927745881868402667694620457976381782672549806486
    2.9.751.294641.p56
227640245125324450927745881868402667694620457976381782672549806488
    8.6651821853617.136241126419733160532013
    31398595528539042286410894991

6493500319937011450007704218333851757321824602859305748523089149897
    p67
6493500319937011450007704218333851757321824602859305748523089149896
    8.3.7.17.17.647.751.38273.257021957.2537261620168202723
    11028070721134373040809603
6493500319937011450007704218333851757321824602859305748523089149898
    2.71.26292652373563702220343453802747.1739226413630785782490582253958377

8013068084032653262407517654187282813856393979050146558081797631812307901576375946587821853073
    p94
8013068084032653262407517654187282813856393979050146558081797631812307901576375946587821853072
    16.3.751.1433.19777.2111446855467627361.40440513178908580709
    5972626530720172492999.15379736065763355855479
8013068084032653262407517654187282813856393979050146558081797631812307901576375946587821853074
    2.121.29.6941503.26561393.194251754724023.1047717590961575751282643.p38


Acknowledgments
----------------
We thank the Dutch National Computing Facilities Foundation NCF for
providing access to the SGI Origin 3800 at SARA (the Academic Computing
Centre Amsterdam); Lehigh University for use of SGI workstations and an SGI
Origin 2000;  the Institute for Pure Mathematics, Bonn University and the
Institute for Applied Mathematics, Bonn University (especially the
Scientific Computing Department) for use of a large number of PCs.

Bruce Dodson, Lehigh University
Jens Franke, Institute for Pure Mathematics, Bonn University
Arjen K. Lenstra, Citicorp
Paul Leyland, Microsoft Research
Peter L. Montgomery, Microsoft Research and CWI
Herman te Riele, CWI
_________________________________________________________________________
Unsubscribe & list info -- http://www.ndatech.com/mersenne/signup.htm
Mersenne Prime FAQ      -- http://www.tasam.com/~lrwiman/FAQ-mers

Reply via email to