A note on the convergence of the secant method for simple and multiple roots∗ Pedro D´IEZ† To appear in Applied Mathematics Letters

Abstract The secant method is one of the most popular methods for root finding. Standard text books in numerical analysis state that the secant method is super linear: the rate of convergence is set by the gold number. Nevertheless, this property holds only for simple roots. If the multiplicity of the root is larger than one, the convergence of the secant method becomes linear. This communication includes a detailed analysis of the secant method when it is used to approximate multiple roots. Thus, a proof of the linear convergence is shown. Moreover, the values of the corresponding asymptotic convergence factors are determined and are found to be also related with the golden ratio.

Keywords: Secant method, Convergence rate, Asymptotic behavior AMS subject classification: 65H05

1

Introduction

The secant method is a variation of the Newton-Raphson method due to Newton himself [1]. The Newton-Raphson method converges quadratically to simple roots. Nevertheless, the convergence degenerates to linear when approximating a root with multiplicity m larger than 1. On the other hand, it is well known that the secant method is superlinear for simple roots. It has also been noticed that the super linear behavior is lost for multiple roots. This note recalls a compact proof of the convergence properties of the Newton-Raphson method and performs an accurate analysis of the secant method in front of multiple roots. ∗ This

work was supported by the Ministerio de Ciencia y Tecnolog´ıa, Spain, DPI2001-2204 Matem` atica Aplicada III, Universitat Polit` ecnica de Catalunya, Campus Nord E-08034 Barcelona, Spain ([email protected]). † Dept.

1

1.1

Assessing the convergence

The goal of an iterative solver for nonlinear scalar equations is to approximate α such that f (α) = 0, being f ©a given function. ª The iterative method furnishes a sequence of approximations x0 , x1 , x2 , . . . . The analysis of the convergence of method © 0such ª is based on characterizing the evolution of the error sequence e , e1 , e2 , . . . , where ek := α − xk . The method is said to be convergent if ¯ ¯ lim ¯ek ¯ = 0. (1) k→∞

More precisely, the method is said to have convergence of order p if it exists λ such that ¯ k+1 ¯ ¯e ¯ (2) lim p ≤ λ, k k→∞ |e |

where λ is denoted Asymptotic Error Constant (AEC) [3]. When p = 1, convergence is reached only if λ ≤ 1 (linear convergence). Assuming that the method is convergent, that is Eq. (1) holds, the order of convergence of the method is studied using the following truncated Taylor expansions: ³¯ ¯ 3 ´ ¡ ¢ 1 f xk = −f 0 (α)ek + f 00 (α)(ek )2 + O ¯ek ¯ (3) 2 (recall that f (α) = 0) and ³¯ ¯ 2 ´ ¡ ¢ f 0 xk = f 0 (α) − f 00 (α)ek + O ¯ek ¯ (4) If α is of multiplicity m > 1 (that is, f (α) = f 0 (α) = · · · = f m−1) (α) = 0), Eq. (3) is replaced by ³¯ ¯m+1 ´ ¡ ¢m ¡ ¢ (−1)m m) f (α) ek + O ¯ ek ¯ (5) f xk = m! and Eq. (4) is replaced by

³¯ ¯ m ´ ¡ ¢ (−1)m−1 m) ¡ ¢m−1 f 0 xk = f (α) ek + O ¯ ek ¯ . (m − 1)!

1.2

(6)

Convergence of Newton-Raphson method

The analysis of the Newton-Raphson method is recalled because both the framework and the results are very similar to those of the secant method. NewtonRaphson iteration generates a sequence of approximations using the following expression: ¡ ¢ f xk xk+1 = xk − 0 k . (7) f (x ) The corresponding expression for the error is ¡ ¢ f xk k+1 k e =e + 0 k f (x ) 2

(8)

The analysis of the behavior of the method for simple roots (f 0 (α) 6= 0) is performed replacing Eqs. (3) and (4) in Eq. (8), that is, ³¯ ¯ ´ 3 − 12 f 00 (α)(ek )2 + O ¯ek ¯ k+1 ´. ³ e = (9) 2 f 0 (α) − f 00 (α)ek + O |ek | Assuming that the method is convergent, in the asymptotic range (for k → ∞) the higher order terms in Eq. (9) are neglected. In the following the “asymptotic equality” is denoted by ³ and is used when the equality holds for k → ∞. Then, ¯ k+1 ¯ ¯e ¯ − 1 f 00 (α)(ek )2 |f 00 (α)| = and lim . (10) ek+1 ³ 2 0 2 k→∞ |ek | f (α) 2 |f 0 (α)| Therefore the convergence is quadratic (p = 2). On the contrary, when the root is of multiplicity m > 1, the analysis is performed replacing Eqs. (5) and (6) in Eq. (8), that is, ³¯ ¯ ´ ¡ ¢m m+1 ¡¯ k ¯¢ (−1)m m) f (α) ek + O ¯ ek ¯ ¯ ¯ m! k+1 k k 1 − 1/m + O e e = e + (−1)m−1 = e . ¡ ¢ m−1 m 1 + O (|ek |) f m) (α) (ek ) + O |ek | (m−1)!

(11)

Then, in the asymptotic range, ¶ µ 1 k+1 ek e ³ 1− m

that is

lim

k→∞

¯ k+1 ¯ ¯e ¯ |ek |

=1−

1 . m

(12)

Consequently, for roots of multiplicity m, the method exhibits linear convergence with AEC equal to 1 − 1/m. In particular, for double roots (m = 2) the AEC is 1/2. The same kind of analysis is performed to derive the convergence properties of the secant method for both simple and multiple roots.

2

Analysis of the secant method

The secant method may be seen as a variation of the Newton-Raphson method. The derivative in Eq. (7) is replaced by an incremental rate: ¡ ¢ ¡ ¢ ¡ k ¢ f xk − f xk−1 0 f x ≈ . (13) xk − xk−1

Thus, the secant iteration is obtained replacing Eq. (13) in Eq. (7), that is ¡ ¢ ¡ ¢ xk−1 f xk − xk f xk+1 k+1 . (14) x = f (xk ) − f (xk−1 )

The corresponding sequence of errors is generated by the following expression: ¡ ¢ ¡ ¢ ek−1 f xk − ek f xk+1 ek+1 = . (15) f (xk ) − f (xk−1 ) 3

2.1

Behavior for simple roots

Let α be a simple root (f 0 (α) 6= 0). Assume that the method is convergent (that is, Eq. (1) holds). Introducing the expansion of Eq. (3) and the corresponding ¡ ¢ version for f xk−1 in Eq. (15) one gets ³¯ ¯ ´ 4 1/2f 00 (α)ek ek−1 + O ¯ek ¯ k+1 ³ ´, e = (16) 3 f 0 (α) + 1/2f 00 (α)(ek + ek−1 ) + O |ek | ¯r ¢ ¡¯ where, since convergence is assumed, the term O ¯ek−1 ¯ dominates the term ¯ ¡¯ r ¢ O ¯ek ¯ for r = 3 and r = 4. Then, the asymptotic version of Eq. (16) (for k → ∞) is f 00 (α) k k−1 e e . (17) ek+1 ³ 0 2f (α) Extracting the logarithm of Eq. (17) and introducing the notation ak := ln ek , yields ln ek+1 ³ C + ln ek + ln ek−1 and ak+1 ³ C + ak + ak−1 , (18) ³ 00 ´ f (α) k where C := ln 2f 0 (α) . That is, the sequence {a } (k = 1, 2, . . .) is a Fibonacci numerical sequence. Assume the following asymptotic behavior ak+1 ³ Φak + β

(19)

Then, imposing (18) for k and k + 1, the only possible (positive) values for Φ and β are found to be √ C 5+1 Φ= ≈ 1.618 and β = (20) 2 Φ Of course, Φ is the gold number related with the Fibonacci sequences. To verify that the assumption introduced in Eq. (19) is valid in the asymptotic range, we introduce the error associated with this hypothesis, that is ¡ ¢ εk+1 := ak+1 − Φak + β . (21)

We characterize the evolution of the error in the assumption of Eq. (19) replacing Eq. (21) in Eq. (18). That yields εk+1 = −

εk . Φ

(22)

That is, the error εk tends linearly to 0 because 1/Φ ≈ 0.618 < 1. Consequently, Eq. (19) tends to be verified asymptotically. Thus, in the asymptotic range, Eq. (19) holds and therefore ¡ ¢Φ ek+1 ³ exp (β) ek , 4

(23)

that is, the secant method exhibits superlinear convergence (convergence of ³ 00 ´1/Φ f (α) order p = Φ > 1) with AEC λ = exp β = 2f . The proof of this 0 (α) property is classical but not unique, see [2] for two different options and a discussion.

2.2

Behavior for multiple roots

Let α be a root with multiplicity m > 1. the expansion of Eq. (5) and the corresponding version for ¡ Introducing ¢ f xk−1 in Eq. (15) one gets ³¯ ¯m+2 ´ ¡ ¢m ¡ ¢m ek−1 ek − ek ek−1 + O ¯ek−1 ¯ ´ ³ , (24) ek+1 = m m m+1 (ek ) − (ek−1 ) + O |ek−1 | ¯r ¢ ¡¯ where, since convergence is assumed, the term O ¯ek−1 ¯ dominates the term ¯ ¡¯ r ¢ O ¯ek ¯ for r = m + 1 and r = m + 2. 2.2.1

Double roots (m = 2)

³¯ ³¯ ¯3 ´ ¯4 ´ For double roots, neglecting the terms O ¯ek−1 ¯ and O ¯ek−1 ¯ , Eq. (24) becomes ek−1 ek , (25) ek+1 ³ k e + ek−1 Eq. (25) may be rewritten as 1 1 1 ³ k + k−1 . ek+1 e e

(26)

Thus, the sequence of 1/ek , k = 0, 1, 2, . . . is a Fibonacci sequence (with C = 0). Then, according to the previous section, the following expressions hold in the asymptotic regime: 1 1 ³Φ k ek+1 e

and therefore ek+1 ³

1 k e . Φ

(27)

Recall that 1/Φ = 1 − Φ ≈ 0.618. Thus, for double roots, the secant method exhibits linear convergence with AEC equal to 1 − Φ. 2.2.2

Multiplicity larger than 2 (m > 2)

The superlinear converge is lost for double roots. Then, for m > 2, the secant method is also assumed to be linear with AEC equal to λ. Thus, in the asymptotic range, the following equality holds ek+1 ³ λek . 5

(28)

Multiplicity (m)

Order (p)

1 2 3 .. .

2 1 1 .. .

m

1

AEC (λ) ¯ 00 ¯ ¯ f (α) ¯ ¯ 2f 0 (α) ¯ 1/2 2/3 .. . 1 − 1/m

Table 1: Summary of the convergence behavior of Newton-Raphson method Multiplicity (m)

Order (p)

1 2 3 .. .

Φ ≈ 1.618 1 1 .. .

m

1

AEC (λ) ¯ 00 ¯1/Φ ¯ f (α) ¯ ¯ 2f 0 (α) ¯ λ = 1/Φ ≈ 0.618 λ ≈ 0.755 .. . 0 < λ < 1 such that λm + λm−1 − 1 = 0

Table 2: Summary of the convergence behavior of the secant method An equation for λ is recovered introducing Eq. (28) in Eq. (24) and neglecting higher order terms: λ2 =

λm − λ ⇒ λ(λ − 1)(λm + λm−1 − 1) = 0. λm − 1

(29)

This polynomial equation has a unique root such that 0 < λ < 1. This root ¡ ¢1/m is slightly lower than (and almost equal to for large m) 12 . Obviously for m = 2 the value of the previous section is recovered and for m = 3 an analytical expression is also available λ=

3

√ ´1/3 1³ 2 100 + 12 69 + ¡ √ ¢1/3 − 1/3 ≈ 0.7548776667 . 6 3 100 + 12 69

Conclusion

The analysis performed in this work proofs that the convergence behavior of the secant method is analog to behavior of the Newton-Raphson method. The convergence properties of both methods are summarized in tables 1 and 2. The analysis shows that the secant method looses the superlinear behavior when the root has multiplicity larger than one. Moreover, the AEC is characterized as one of the solutions of a polynomial equation of degree m.

6

References [1] C.T. Kelly, Iterative methods for linear and nonlinear equations, SIAM, Philadelphia, 1995. [2] G.W. Stewart, Afternotes on numerical analysis, SIAM, Philadelphia, 1996. [3] A. Ralston and P. Rabinowitz, A first course in numerical analysis, McGraw-Hill, New York, 1978.

7

A note on the convergence of the secant method for ...

Abstract. The secant method is one of the most popular methods for root finding. Standard text books in numerical analysis state that the secant method is super ...

96KB Sizes 0 Downloads 239 Views

Recommend Documents

A note on the H1-convergence of the overlapping Schwarz waveform ...
time interfaces need to be stored in the iterative process. We refer to [6,7,8,9] for the early development of the overlapping SWR method, [10,11,12,13] for convergence analyses of the method, [14,15,16] for extension to nonlinear prob- lems, [17,18,

Note on the Voice of the Customer
on the ambient room light and reflections, the colors that the software designer chooses, the ... 10. Classes have a balance between theory and real-world application. ... A wide selection of companies and industries that recruit at the business.

A NOTE ON THE NONEXISTENCE OF SUM OF ...
The coefficient of tk in pm is the trace of Sm,k(A, B) := the sum of all words of length m in A and B in which B appears exactly k times (and therefore A exactly m − k times). In his ingenious 2007 paper [Häg07], Hägele found a dimension-free alg

A NOTE ON THE TRACE THEOREM FOR DOMAINS ...
is to fill that gap and generalize that result for ω ⊂ Rn−1, n > 1, and arbitrary ..... the Texas Higher Education Coordinating Board, Advanced Research Program.

A note on moment convergence of bootstrap M-estimators
3.6.1), we view X1,X2,... as the coordinate projection on the first countably infi-. 109 ...... 360. Nishiyama, Y. (2010). Moment convergence of M-estimators. Statist.

A note on the identification of dynamic economic ...
DSGE models with generalized shock processes, such as shock processes which fol- low a VAR, have been an active area of research in recent years. Unfortunately, the structural parameters governing DSGE models are not identified when the driving pro-

A Note on the Power of Truthful Approximation ...
Aug 26, 2009 - Email: [email protected]. 1 ... The private information of each bidder is vi (notice that the ki's are private information too). We will assume that the ... Of course, we can also implement a weighted versions of VCG: Definition ..

Concept paper on the need for revision of the Note for Guidance on ...
Jul 21, 2016 - domains, e.g. cognition, are paramount across CNS disorders and warrant ... acceptance of the diagnostic and assessment tools available (7).

Concept paper on the need for revision of the Note for Guidance on ...
Jul 21, 2016 - 30 Churchill Place ○ Canary Wharf ○ London E14 5EU ○ United Kingdom. An agency of the European Union. Telephone +44 (0)20 3660 .... It is expected that the revised Guideline will be helpful in designing state of the art clinical

A NOTE ON STOCHASTIC ORDERING OF THE ... - Springer Link
Only the partial credit model (Masters, 1982) and special cases of this model (e.g., the rat- ing scale model, Andrich, 1978) imply SOL (Hemker et al., 1997, ...

On the Design Method of a Haptic Interface ... - Semantic Scholar
The Z-width which is the dynamic range of achievable impedance was introduced[2] and it represents the range of impedance when the passivity is guaranteed.

On the contact domain method: A comparison of ...
This work focuses on the assessment of the relative performance of the so-called contact domain method, using either the Lagrange multiplier or the penalty ...

On the Design Method of a Haptic Interface ... - Semantic Scholar
Abstract: A haptic interface can be a passive system with virtual coupling as a filter. Virtual coupling has been designed for satisfying passivity. However, it affects transparency of haptic interface as well as stability. This paper suggests new de

A numerical method for the computation of the ...
Considering the equation (1) in its integral form and using the condition (11) we obtain that sup ..... Stud, 17 Princeton University Press, Princeton, NJ, 1949. ... [6] T. Barker, R. Bowles and W. Williams, Development and application of a.

On the Convergence of Perturbed Non-Stationary ...
Communications Research in Signal Processing Group, School of Electrical and Computer Engineering. Cornell University, Ithaca ..... transmissions to achieve consensus, with the trade-off being that they ..... Science, M.I.T., Boston, MA, 1984.

A Note on the Inefficiency of Bidding over the Price of a ...
the class we consider, the price per unit share depends only on the first and ... (in the form of an independent audit) or endogenous (in the form of inter-agent.

On the uniform convergence of random series in ... - Project Euclid
obtain explicit representations of the jump process, and of related path func- ...... dr. Then, with probability 1 as u → ∞,. Y u. (t) → Y(t). (3.4) uniformly in t ∈ [0,1], ...

On the Convergence of Stochastic Gradient MCMC ... - Duke University
†Dept. of Electrical and Computer Engineering, Duke University, Durham, NC, USA ..... to select the best prefactors, which resulted in h=0.033×L−α for the ...

On the Convergence of Perturbed Non-Stationary ...
Cornell University, Ithaca, NY, 14853. ‡ Signal Processing and Communications Group, Electrical and Computer ...... Science, M.I.T., Boston, MA, 1984.

On the Convergence of Iterative Voting: How Restrictive ...
We study convergence properties of iterative voting pro- cedures. Such procedures are ... agent systems that involve entities with possibly diverse preferences.

On Stability and Convergence of the Population ...
In Artificial Intelligence (AI), an evolutionary algorithm (EA) is a subset of evolutionary .... online at http://www.icsi.berkeley.edu/~storn/code.html are listed below:.

On Stability and Convergence of the Population ...
1 Department of Electronics and Telecommunication Engineering. Jadavpur University, Kolkata, India. 2 Norwegian University of Science and Technology, Norway ... performance of EAs and to propose new algorithms [9] because the solution ...