The Elliptic Curve Factorization Method¶
The elliptic curve factorization method (ECM) is the fastest way to
factor a known composite integer if one of the factors is
relatively small (up to approximately 80 bits / 25 decimal digits). To
factor an arbitrary integer it must be combined with a primality
test. The ECM.factor()
method is an example for how to combine
ECM with a primality test to compute the prime factorization of integers.
Sage includes GMP-ECM, which is a highly optimized implementation of Lenstra’s elliptic curve factorization method. See https://gitlab.inria.fr/zimmerma/ecm for more about GMP-ECM.
AUTHORS:
These people wrote GMP-ECM: Pierrick Gaudry, Jim Fougeron, Laurent Fousse, Alexander Kruppa, Dave Newman, Paul Zimmermann
BUGS:
Output from ecm is non-deterministic. Doctests should set the random seed, but currently there is no facility to do so.
- class sage.interfaces.ecm.ECM(B1=10, B2=None, **kwds)[source]¶
Bases:
SageObject
Create an interface to the GMP-ECM elliptic curve method factorization program.
See https://gitlab.inria.fr/zimmerma/ecm
INPUT:
B1
– integer; Stage 1 boundB2
– integer; Stage 2 bound (or interval B2min-B2max)
In addition the following keyword arguments can be used:
x0
– integer \(x\); use \(x\) as initial pointsigma
– integer \(s\); use s as curve generator [ecm]A
– integer \(a\); use a as curve parameter [ecm]k
– integer \(n\); perform \(>= n\) steps in stage 2power
– integer \(n\); use \(x^n\) for Brent-Suyama’s extensiondickson
– integer \(n\); use \(n\)-th Dickson’s polynomial for Brent-Suyama’s extensionc
– integer \(n\); perform \(n\) runs for each inputpm1
– boolean; perform P-1 instead of ECMpp1
– boolean; perform P+1 instead of ECMq
– boolean; quiet modev
– boolean; verbose modetimestamp
– boolean; print a time stamp with each numbermpzmod
– boolean; use GMP’s mpz_mod for mod reductionmodmuln
– boolean; use Montgomery’s MODMULN for mod reductionredc
– boolean; use Montgomery’s REDC for mod reductionnobase2
– boolean; disable special base-2 codebase2
– integer \(n\); force base 2 mode with 2^n+1 (n>0) or 2^n-1 (n<0)save
– string filename; save residues at end of stage 1 to filesavea
– string filename; Like -save, appends to existing filesresume
– string filename; resume residues from file, reads from stdin if file is “-”primetest
– boolean; perform a primality test on inputtreefile
– string; store product tree of F in files f.0 f.1 …i
– integer; increment B1 by this constant on each runI
– integer \(f\); auto-calculated increment for B1 multiplied by \(f\) scale factorinp
– string; use file as input (instead of redirecting stdin)b
– boolean; use breadth-first mode of file processingd
– boolean; use depth-first mode of file processing (default)one
– boolean; stop processing a candidate if a factor is found (looping mode )n
– boolean; run ecm in ‘nice’ mode (below normal priority)nn
– boolean; run ecm in ‘very nice’ mode (idle priority)t
– integer \(n\); trial divide candidates before P-1, P+1 or ECM up to \(n\)ve
– integer \(n\); verbosely show short (\(< n\) character) expressions on each loopB2scale
– integer; multiplies the default B2 valuego
– integer; preload with group order val, which can be a simple expression, or can use N as a placeholder for the number being factoredprp
– string; use shell command cmd to do large primality testsprplen
– integer; only candidates longer than this number of digits are ‘large’prpval
– integer; value>=0 which indicates the prp command foundnumber to be PRPprptmp
– file; outputs n value to temp file prior to running (NB. gets deleted)prplog
– file; otherwise get PRP results from this file (NB. gets deleted)prpyes
– string; literal string found in prplog file when number is PRPprpno
– string; literal string found in prplog file when number is composite
- factor(n, factor_digits=None, B1=2000, proof=False, **kwds)[source]¶
Return a probable prime factorization of \(n\).
Combines GMP-ECM with a primality test, see
is_prime()
. The primality test is provable or probabilistic depending on the \(proof\) flag.Moreover, for small \(n\) PARI is used directly.
Warning
There is no mathematical guarantee that the factors returned are actually prime if
proof=False
(default). It is extremely likely, though. Currently, there are no known examples where this fails.INPUT:
n
– positive integerfactor_digits
– integer orNone
(default); optional guess at how many digits are in the smallest factorB1
– initial lower bound, defaults to 2000 (15 digit factors); used iffactor_digits
is not specifiedproof
– boolean (default:False
); whether to prove that the factors are primekwds
– keyword arguments to pass to ecm-gmp; see help forECM
for more details
OUTPUT: list of integers whose product is \(n\)
Note
Trial division should typically be performed, but this is not implemented (yet) in this method.
If you suspect that n is the product of two similarly-sized primes, other methods (such as a quadratic sieve – use the qsieve command) will usually be faster.
The best known algorithm for factoring in the case where all factors are large is the general number field sieve. This is not implemented in Sage; You probably want to use a cluster for problems of this size.
EXAMPLES:
sage: ecm.factor(602400691612422154516282778947806249229526581) [45949729863572179, 13109994191499930367061460439] sage: ecm.factor((2^197 + 1)/3) # long time [197002597249, 1348959352853811313, 251951573867253012259144010843] sage: ecm.factor(179427217^13) == [179427217] * 13 True
>>> from sage.all import * >>> ecm.factor(Integer(602400691612422154516282778947806249229526581)) [45949729863572179, 13109994191499930367061460439] >>> ecm.factor((Integer(2)**Integer(197) + Integer(1))/Integer(3)) # long time [197002597249, 1348959352853811313, 251951573867253012259144010843] >>> ecm.factor(Integer(179427217)**Integer(13)) == [Integer(179427217)] * Integer(13) True
- find_factor(n, factor_digits=None, B1=2000, **kwds)[source]¶
Return a factor of n.
See also
factor()
if you want a prime factorization of \(n\).INPUT:
n
– positive integerfactor_digits
– integer orNone
(default); decimal digits estimate of the wanted factorB1
– integer; Stage 1 bound (default: 2000). This is used as bound iffactor_digits
is not specifiedkwds
– optional keyword parameters
OUTPUT:
List of integers whose product is n. For certain lengths of the factor, this is the best algorithm to find a factor.
Note
ECM is not a good primality test. Not finding a factorization is only weak evidence for \(n\) being prime. You should run a good primality test before calling this function.
EXAMPLES:
sage: f = ECM() sage: n = 508021860739623467191080372196682785441177798407961 sage: f.find_factor(n) [79792266297612017, 6366805760909027985741435139224233]
>>> from sage.all import * >>> f = ECM() >>> n = Integer(508021860739623467191080372196682785441177798407961) >>> f.find_factor(n) [79792266297612017, 6366805760909027985741435139224233]
Note that the input number cannot have more than 4095 digits:
sage: f = 2^2^14+1 sage: ecm.find_factor(f) Traceback (most recent call last): ... ValueError: n must have at most 4095 digits
>>> from sage.all import * >>> f = Integer(2)**Integer(2)**Integer(14)+Integer(1) >>> ecm.find_factor(f) Traceback (most recent call last): ... ValueError: n must have at most 4095 digits
- get_last_params()[source]¶
Return the parameters (including the curve) of the last ecm run.
In the case that the number was factored successfully, this will return the parameters that yielded the factorization.
OUTPUT:
A dictionary containing the parameters for the most recent factorization.
EXAMPLES:
sage: ecm.factor((2^197 + 1)/3) # long time [197002597249, 1348959352853811313, 251951573867253012259144010843] sage: ecm.get_last_params() # random output {'poly': 'x^1', 'sigma': '1785694449', 'B1': '8885', 'B2': '1002846'}
>>> from sage.all import * >>> ecm.factor((Integer(2)**Integer(197) + Integer(1))/Integer(3)) # long time [197002597249, 1348959352853811313, 251951573867253012259144010843] >>> ecm.get_last_params() # random output {'poly': 'x^1', 'sigma': '1785694449', 'B1': '8885', 'B2': '1002846'}
- interact()[source]¶
Interactively interact with the ECM program.
EXAMPLES:
sage: ecm.interact() # not tested
>>> from sage.all import * >>> ecm.interact() # not tested
- one_curve(n, factor_digits=None, B1=2000, algorithm='ECM', **kwds)[source]¶
Run one single ECM (or P-1/P+1) curve on input n.
Note that trying a single curve is not particularly useful by itself. One typically needs to run over thousands of trial curves to factor \(n\).
INPUT:
n
– positive integerfactor_digits
– integer; decimal digits estimate of the wanted factorB1
– integer; Stage 1 bound (default: 2000)algorithm
– either “ECM” (default); “P-1” or “P+1”
OUTPUT:
A list
[p, q]
where p and q are integers and n = p * q. If no factor was found, then p = 1 and q = n.Warning
Neither p nor q in the output is guaranteed to be prime.
EXAMPLES:
sage: f = ECM() sage: n = 508021860739623467191080372196682785441177798407961 sage: f.one_curve(n, B1=10000, sigma=11) [1, 508021860739623467191080372196682785441177798407961] sage: f.one_curve(n, B1=10000, sigma=1022170541) [79792266297612017, 6366805760909027985741435139224233] sage: n = 432132887883903108009802143314445113500016816977037257 sage: f.one_curve(n, B1=500000, algorithm='P-1') [67872792749091946529, 6366805760909027985741435139224233] sage: n = 2088352670731726262548647919416588631875815083 sage: f.one_curve(n, B1=2000, algorithm='P+1', x0=5) [328006342451, 6366805760909027985741435139224233]
>>> from sage.all import * >>> f = ECM() >>> n = Integer(508021860739623467191080372196682785441177798407961) >>> f.one_curve(n, B1=Integer(10000), sigma=Integer(11)) [1, 508021860739623467191080372196682785441177798407961] >>> f.one_curve(n, B1=Integer(10000), sigma=Integer(1022170541)) [79792266297612017, 6366805760909027985741435139224233] >>> n = Integer(432132887883903108009802143314445113500016816977037257) >>> f.one_curve(n, B1=Integer(500000), algorithm='P-1') [67872792749091946529, 6366805760909027985741435139224233] >>> n = Integer(2088352670731726262548647919416588631875815083) >>> f.one_curve(n, B1=Integer(2000), algorithm='P+1', x0=Integer(5)) [328006342451, 6366805760909027985741435139224233]
- recommended_B1(factor_digits)[source]¶
Return recommended
B1
setting.INPUT:
factor_digits
– integer; number of digits
OUTPUT:
Integer. Recommended settings from http://www.mersennewiki.org/index.php/Elliptic_Curve_Method
EXAMPLES:
sage: ecm.recommended_B1(33) 1000000
>>> from sage.all import * >>> ecm.recommended_B1(Integer(33)) 1000000
- time(n, factor_digits, verbose=False)[source]¶
Print a runtime estimate.
BUGS:
This method should really return something and not just print stuff on the screen.
INPUT:
n
– positive integerfactor_digits
– the (estimated) number of digits of the smallest factor
OUTPUT:
An approximation for the amount of time it will take to find a factor of size factor_digits in a single process on the current computer. This estimate is provided by GMP-ECM’s verbose option on a single run of a curve.
EXAMPLES:
sage: n = next_prime(11^23)*next_prime(11^37) sage: ecm.time(n, 35) # random output Expected curves: 910, Expected time: 23.95m sage: ecm.time(n, 30, verbose=True) # random output GMP-ECM 6.4.4 [configured with MPIR 2.6.0, --enable-asm-redc] [ECM] Running on localhost.localdomain Input number is 304481639541418099574459496544854621998616257489887231115912293 (63 digits) Using MODMULN [mulredc:0, sqrredc:0] Using B1=250000, B2=128992510, polynomial Dickson(3), sigma=3244548117 dF=2048, k=3, d=19110, d2=11, i0=3 Expected number of curves to find a factor of n digits: 35 40 45 50 55 60 65 70 75 80 4911 70940 1226976 2.5e+07 5.8e+08 1.6e+10 2.7e+13 4e+18 5.4e+23 Inf Step 1 took 230ms Using 10 small primes for NTT Estimated memory usage: 4040K Initializing tables of differences for F took 0ms Computing roots of F took 9ms Building F from its roots took 16ms Computing 1/F took 9ms Initializing table of differences for G took 0ms Computing roots of G took 8ms Building G from its roots took 16ms Computing roots of G took 7ms Building G from its roots took 16ms Computing G * H took 6ms Reducing G * H mod F took 5ms Computing roots of G took 7ms Building G from its roots took 17ms Computing G * H took 5ms Reducing G * H mod F took 5ms Computing polyeval(F,G) took 34ms Computing product of all F(g_i) took 0ms Step 2 took 164ms Expected time to find a factor of n digits: 35 40 45 50 55 60 65 70 75 80 32.25m 7.76h 5.60d 114.21d 7.27y 196.42y 337811y 5e+10y 7e+15y Inf Expected curves: 4911, Expected time: 32.25m
>>> from sage.all import * >>> n = next_prime(Integer(11)**Integer(23))*next_prime(Integer(11)**Integer(37)) >>> ecm.time(n, Integer(35)) # random output Expected curves: 910, Expected time: 23.95m >>> ecm.time(n, Integer(30), verbose=True) # random output GMP-ECM 6.4.4 [configured with MPIR 2.6.0, --enable-asm-redc] [ECM] Running on localhost.localdomain Input number is 304481639541418099574459496544854621998616257489887231115912293 (63 digits) Using MODMULN [mulredc:0, sqrredc:0] Using B1=250000, B2=128992510, polynomial Dickson(3), sigma=3244548117 dF=2048, k=3, d=19110, d2=11, i0=3 Expected number of curves to find a factor of n digits: 35 40 45 50 55 60 65 70 75 80 4911 70940 1226976 2.5e+07 5.8e+08 1.6e+10 2.7e+13 4e+18 5.4e+23 Inf Step 1 took 230ms Using 10 small primes for NTT Estimated memory usage: 4040K Initializing tables of differences for F took 0ms Computing roots of F took 9ms Building F from its roots took 16ms Computing 1/F took 9ms Initializing table of differences for G took 0ms Computing roots of G took 8ms Building G from its roots took 16ms Computing roots of G took 7ms Building G from its roots took 16ms Computing G * H took 6ms Reducing G * H mod F took 5ms Computing roots of G took 7ms Building G from its roots took 17ms Computing G * H took 5ms Reducing G * H mod F took 5ms Computing polyeval(F,G) took 34ms Computing product of all F(g_i) took 0ms Step 2 took 164ms Expected time to find a factor of n digits: 35 40 45 50 55 60 65 70 75 80 32.25m 7.76h 5.60d 114.21d 7.27y 196.42y 337811y 5e+10y 7e+15y Inf <BLANKLINE> Expected curves: 4911, Expected time: 32.25m