LNCS 3973 - Local Volatility Function Approximation ... - Springer Link

Viewer
Transcript

Local Volatility Function Approximation Using Reconstructed Radial Basis Function Networks Bo-Hyun Kim, Daewon Lee, and Jaewook Lee Department of Industrial and Management Engineering, Pohang University of Science and Technology, Pohang, Kyungbuk 790-784, Korea [email protected]

Abstract. Modelling volatility smile is very important in ﬁnancial practice for pricing and hedging derivatives. In this paper, a novel learning method to approximate a local volatility function from a ﬁnite market data set is proposed. The proposed method trains a RBF network with fewer volatility data and ﬁnds an optimized network through option pricing error minimization. Numerical experiments are conducted on S&P 500 call option market data to illustrate a local volatility surface estimated by the method.

1

Introduction

Volatility is known as one of the most important market variable in ﬁnancial practice as well as in ﬁnancial theory, although it is not directly observable in the market. The celebrated Black-Scholes (BS) model that assumes constant volatility has been widely used to estimate volatility (often called implied volatility since it calculates the volatility by inverting the Black-Scholes formula with option price data given by the market) [3, 14]. If the assumption of BS model is reasonable, the implied volatility should be the same for all option market prices. In reality, however, it has been observed that the implied volatility shows strong dependence on strike price and time to maturity. This dependence, called the volatility smile, cannot be captured in BS model, and results in failure to produce an appropriate volatility for the corresponding option [17, 18]. One practical solution for the volatility smile is the constant implied volatility approach which is to simply use diﬀerent volatilities for options with diﬀerent strikes and maturities. In other words, if we had options with the whole range of strikes and maturities, we could simply calculate the implied volatility for each pair of strike and maturity by inverting the BS formula for each option. Although it works well for pricing simple European options, it cannot provide appropriate implied volatilities for pricing more complicated options such as exotic options or American options. Moreover, this approach can produce incorrect hedge factors like Gamma, Vega, Delta, etc. even for simple options [8, 12]. During the last decade, a number of researches based on diﬀerent type of models have been conducted to model the volatility smile [15, 8, 1, 2]. One of the J. Wang et al. (Eds.): ISNN 2006, LNCS 3973, pp. 524–530, 2006. c Springer-Verlag Berlin Heidelberg 2006

Local Volatility Function Approximation

525

successful type of model is to use a 1-factor continuous diﬀusion model. This model, describing volatility as a function of strikes (or stocks) and maturities (called a local volatility function), is turned out to be a complete model by excluding non-traded source of risks and allows for arbitrage pricing and hedging [5]. When the underlying asset follows this model, one important task is then to accurately approximate the local volatility function for accurately pricing exotic options and computing correct hedge factors [4]. Estimating a local volatility function is, however, a nontrivial task since it is generally an ill-posed problem due to insuﬃcient market option price data. In this paper, we propose a novel learning method to approximate a local volatility function from a ﬁnite market data set in pricing and hedging derivatives. The proposed method consists of two phases. In the ﬁrst phase, we train a preparatory radial basis function (RBF) network from an available volatility data to estimate the local volatility surface and then obtain the estimated volatilities for the whole range of strikes and maturities. In the second phase, we reconstruct an optimized RBF network by controlling the volatility values of a preparatory RBF network to minimize the error between estimated option prices and real market option prices. The organization of this paper is as follows. In section 2, we propose a learning method to model a volatility smile. Computational simulation applied to the S& P 500 index options are conducted in Section 3. Section 4 concludes the result.

2

The Proposed Method

The 1-factor continuous diﬀusion model assumes the underlying asset follows the following process with the initial value Sinit : dSt = μ(St , t)dt + σ(St , t)dWt , St

t ∈ [0, τ ],

τ >0

(1)

where τ is a ﬁxed time horizon, Wt is a standard Brownian motion and μ(s, t), σ(s, t) : R+ × [0, τ ] → R are deterministic functions suﬃciently well behaved to guarantee that (1) has a unique solution [9]. σ(s, t) is the local volatility function. When we estimate a local volatility function from a ﬁnite data set, we can avoid over-ﬁtting problem by regularizing with some kind of smoothness of the local volatility function. A RBF network is a well-known method that is capable of solving ill-posed problems with regularization [7]. In this section, we propose a method to approximate the local volatility function using RBF networks when the underlying asset follows a 1-factor model (1). The local volatility function σ(s, t) will be explicitly represented by a reconstructed RBF network. 2.1

Phase I: Initial Local Volatility Function Approximation Using RBF Networks

In the ﬁrst phase, we initially approximate a local volatility function by using RBF networks. A RBF network is constructed with a given training volatility

526

B.-H. Kim, D. Lee, and J. Lee

data set. Each training input has two attributes; strike price and time to maturity, (Kjtr , Tjtr ), j = 1, ..., l and training output has its corresponding local volatility value, σj . A RBF network involves searching for a suboptimal solution in a lower-dimensional space that approximates the interpolation solution where the approximated solution σ ˆRBF (w) can be expressed as follows: σ ˆRBF (w; K, T ) =

l j=1

wj φ

(K, T ) − (Kjtr , Tjtr ) η

(2)

where each φ = φ((K, T ), (Kjtr , Tjtr )) is a radial basis function centered at (Kjtr , Tjtr ) and η is an user pre-speciﬁed scale parameter. The training procedure of the RBF network is to estimate the weights that connect the hidden and the output layers and these weights will be directly estimated by using the least squares algorithm [7, 6, 11]. To determine the optimal network weights that connect the hidden and the output layers from a training data set, {Kjtr , Tjtr , σj }lj=1 , we ﬁt an initial local volatility surface by minimizing the following criterion function with a weight decay regularization term J(w) =

l

σj − σ ˆRBF (w; Kjtr , Tjtr )2 + λw2 .

(3)

j=1

where λ is a regularization parameter introduced to avoid over-ﬁtting. The network weights can then be explicitly given by w = (Φ + λI)−1 σ

(4)

where Φ = [φ((Kitr , Titr ), (Kjtr , Tjtr ))]i,j=1,...,l and σ = (σ1 , ..., σl )T . Substituting Eq. (4) into Eq. (2) makes us to rewrite σ ˆRBF (w; K, T ) as σ ˆRBF (σ; K, T ). Initially, we do not use the training volatility information σ at this stage for the estimated option prices to match the market option prices as closely as possible. Instead, we randomly generate the initial volatility vector, σ (0) ∈ l in Eq. (4), as nonnegative values. For this reason, we will call the RBF network obtained in the ﬁrst phase a preparatory RBF network. 2.2

Reconstructing RBF Networks Via Pricing Error Minimization

To obtain a better local volatility function that minimizes the option pricing error, in the second phase, we reconstruct the RBF network by iteratively updating the current volatility vector σ (0) to a better one. Let pi be the i-th option market price with (Ki , Ti ) as its strike price and maturity for i = 1, ..., m. Note that the set {(Ki , Ti )}m i=1 has its corresponding option price pi and is a diﬀerent set from the training input data set {(Kjtr , Tjtr )}lj=1 . Also let Pi be the i-th option pricing formula (highly nonlinear function with respect to its input) where its strike price, maturity, and volatility are given ˆ RBF , respectively. Utilizing this information, Phase II tries to ﬁnd at Ki , Ti , σ

Local Volatility Function Approximation

527

an optimal volatility, σ ∗ = (σ1∗ , ..., σl∗ ), to reconstruct a ﬁnal RBF network by solving the following nonlinear optimization: min E(σ) = σ

m

ˆ RBF (σ; Ki , Ti )]} {pi − Pi [Ki , Ti , σ

2

(5)

i=1

+α

l

σjtr − σ ˆRBF (σ; Kjtr , Tjtr )2

j=1

where σjtr is a volatility output at a training input (Kjtr , Tjtr ) and α is an usercontrollable compensation parameter. In this paper, we will call the RBF corresponding to this optimal volatility σ ∗ as a reconstructed RBF. To get the optimal solution that minimizes Eq. (5) eﬃciently, we employ a trust region algorithm described as follows. For a volatility vector σ(n) at iteration n, the quadratic approximation Eˆ is deﬁned by the ﬁrst two terms of the Taylor approximation to E at σ(n); 1 ˆ E(s) = E(σ(n)) + g(n)T s + sT H(n)s 2

(6)

where g(n) is the local gradient vector and H(n) is the local Hessian matrix. A trial step s(n) is then computed by minimizing (or approximately minimizing) the trust region subproblem stated by ˆ subject to min E(s) s

s2 ≤ Δn

(7)

where Δn > 0 is a trust-region parameter. According to the agreement between predicted and actual reduction in the function E as measured by the ratio ρn =

E(σ(n)) − E(σ(n) + s(n)) , ˆ ˆ E(0) − E(s(n))

Δn is adjusted between iterations as follows: ⎧ ⎨ s(n)2 /4 if ρn < 0.25 if ρn > 0.75 and Δn = s(n)2 Δn+1 = 2Δn ⎩ Δn otherwise The decision to accept the step is then given by σ(n) + s(n) if ρn ≥ 0 σ(n + 1) = σ(n) otherwise

(8)

(9)

(10)

which means that the current weight vector is updated to be σ(n) + s(n) if E(σ(n) + s(n)) < E(σ(n)); Otherwise, it remains unchanged and the trust region parameter Δn is shrunk and the trial step computation is repeated. One nice property of the trust region method is that this method retains the rapid rate of convergence of Newton’s method, but is also generally applicable and globally convergent [10, 13, 16].

528

3

B.-H. Kim, D. Lee, and J. Lee

Computational Examples

We use the S&P 500 Index European call option data of October 1995, which is also used in [1, 4]. The market option price data is given in Table 1. Only the options with no more than two years maturity are used for accuracy. The initial index, interest rate, and dividend rate are set as follows: Sinit = $590, r = 0.06, q = 0.0262 In Phase I, we used the following 16 training volatility input data to train a preparatory RBF network: (Kjtr , Tjtr ) = (K(p), T (q)), j = 4(p − 1) + q, p, q = 1, ..., 4 K = [0.8000Sinit , 0.9320Sinit, 1.0640Sinit , 1.3940Sinit ] T = [0, 0.66, 1.32, 1.98] In Phase II, we used 70 S&P 500 Index option price data given in Table 1. Table 1. S&P 500 Index call option price Maturity .175 .425 .695 .94 1 1.5 2

85% 9.13e1 9.63e1 1.02e2 1.07e2 1.08e2 1.17e2 1.26e2

90% 6.28e1 6.91e1 7.61e1 8.22e1 8.36e1 9.44e1 1.04e1

95% 3.52e1 4.40e1 5.26e1 5.99e1 6.16e1 7.31e1 8.36e1

Strike(% of spot price) 100% 105% 110% 115% 1.29e1 2.11 1.21e−1 3.73e−2 2.33e1 8.54 2.26 4.21e−1 1 1 3.26e 1.64e 5.95 1.90 3.99e1 2.38e1 1.13e1 4.71 4.16e1 2.54e1 1.28e1 5.50 5.40e1 3.73e1 2.37e1 1.43e1 6.49e1 4.82e1 3.42e1 2.36e1

120% 1.62e−2 1.94e−1 6.04e−1 1.78 2.13 7.65 1.47e1

130% 1.65e−3 2.77e−2 7.25e−2 1.82e−1 2.27e−1 1.85 5.65

140% 5.14e−4 8.72e−3 2.66e−2 4.48e−2 5.44e−2 3.10e−1 1.78

The proposed method is compared with a popularly used natural cubic spline method (cf. [4]). The spline approach normally uses the same amount of training data (with a diﬀerent set of strike prices and maturities) for its knot points as that of option market data to get a reasonable performance. The spline function is then reconstructed from the 70 option market data by minimizing option pricing error with respect to 70 volatility input variables. The criteria for comparison are pricing error (MSE; mean squared error) and hedging error between predicted volatility and true implied volatility. Table 2 demonstrates the feature of our proposed method: that is, the proposed method achieved a better or comparable performance compared to spline approach in pricing and hedging derivatives with fewer input variables (16 in our method and 70 in spline approach). Fig. 1 shows the estimated volatility surfaces generated by the spline and the proposed. The proposed method showed more skewed volatility smile eﬀects as observed in practice.

Local Volatility Function Approximation

529

Table 2. Accuracy of Pricing and Hedging

Pricing Hedging

(a)

Measure Reconstructed Spline Reconstructed RBF Price 0.0317 0.0284 Vega 7.1153 5.8509 Delta 1.1535e−5 1.0255e−5 −8 Gamma 1.3316e 1.0839e−8 Rho 1.5961 1.6449 Theta 0.2935 0.3129

(b)

Fig. 1. Estimated volatility surfaces. (a) spline approach and (b) proposed method.

4

Conclusion

In this paper, we’ve proposed a novel learning method to approximate the local volatility function. The proposed method ﬁrst trains a preparatory RBF network with a training volatility data set to estimate volatility surface and then reconstructs an optimized RBF network by minimizing the errors between estimated prices and real market prices. A simulation has been conducted with a S&P500 option data example. The experimental results demonstrated a performance improvement of the proposed method compared to other approach in terms of pricing and hedging error. Acknowledgement. This work was supported by the Korea Research Foundation under grant number KRF-2004-041-D00785.

References 1. Andersen, L.B.G., Brotherton-Ratcliﬀe, R.: The Equity Option Volatility Smile: An Implicit Finite Diﬀerence Approach. Journal of Computational Finance 1(2) (1997) 2. Avellaneda, M., Friedman, M., Holmes, C., Samperi, D.: Calibrating Volatility Surfaces via Relative Entropy Minimazation. Applied Mathematical Finances 4 (1997) 37-64

530

B.-H. Kim, D. Lee, and J. Lee

3. Black, F., Scholes, M.: The Pricing of Options and Corporate Liabilities. Journal of Political Economy 81 (1973) 637-659 4. Coleman, T.F., Li, Y., Verma, A.: Reconstructing the Unknown Local Volatility Function. The Journal of Computational Finance 2(3) (1999) 5. Dupire, B.: Pricing with a Smile. Risk 7 (1994) 18-20 6. Han, G.S., Lee, D., Lee J. : Estimating The Yield Curve Using Calibrated Radial Basis Function Networks. Lecture Notes in Computer Science 3497 (2005) 885-890 7. Haykin, S.: Neural Networks: a Comprehensive Foundation. Prentice-Hall New York (1999) 8. Hull, J., White, A.: The Pricing of Options on Assets with Stochastic Volailities. Journal of Finance 3 (1987) 281-300 9. Lamberton, D., Lapeyre, B.: Introduction to Stochastic Calculus Applied to Finance. Chapman & Hall (1996) 10. Lee, J.: Attractor-Based Trust-Region Algorithm for Eﬃcient Training of Multilayer Perceptrons. Electronics Letters 39 (2003) 71-72 11. Lee, D., Lee, J.: A Novel Three-Phase Algorithm for RBF Neural Network Center Selection. Lecture Notes in Computer Science 3173 (2005) 350-355 12. Lee, H.S., Lee, J., Yoon, Y.G., Kim, S.: Coherent Risk Meausure Using Feedfoward Neural Networks. Lecture Notes in Computer Science 3497 (2005) 904-909 13. Lee, J., Chiang, H.D.: A Dynamical Trajectory-Based Methodology for Systematically Computing Multiple Optimal Solutions of General Nonlinear Programming Problems. IEEE Trans. on Automatic Control 49(6) (2004) 888 - 899 14. Merton, R.: The Theory of Rational Option Pricing. Bell Journal of Economics and Managemenet Science 4 (1973) 141-183 15. Merton, R.: Option Pricing when Underlying Stock Returns Are Discontinuous. Journal of Financial Economics 3 (1976) 124-144 16. Nocedal, J., Wright, S.J.: Numerical Optimization. Springer New York (1999) 17. Rubinstein, M.: Implied Binomial Trees. The Journal of Finance 49 (1994) 771-818 18. Shimko,D.: Bounds of Probability. Risk (1993) 33-37

LNCS 6622 - NILS: A Neutrality-Based Iterated Local ... - Springer Link

LNCS 6361 - Automatic Segmentation and ... - Springer Link

LNCS 6621 - GP-Based Electricity Price Forecasting - Springer Link

LNCS 6683 - On the Neutrality of Flowshop Scheduling ... - Springer Link

LNCS 4261 - Image Annotations Based on Semi ... - Springer Link

LNCS 7335 - Usage Pattern-Based Prefetching: Quick ... - Springer Link

LNCS 4191 - Registration of Microscopic Iris Image ... - Springer Link

LNCS 6719 - Multiple People Activity Recognition ... - Springer Link

LNCS 3174 - Multi-stage Neural Networks for Channel ... - Springer Link

LNCS 3352 - On Session Identifiers in Provably Secure ... - Springer Link

LNCS 4731 - On the Power of Impersonation Attacks - Springer Link

LNCS 4325 - An Integrated Self-deployment and ... - Springer Link

LNCS 650 - Emergence of Complexity in Financial ... - Springer Link

LNCS 4233 - Fast Learning for Statistical Face Detection - Springer Link

LNCS 6621 - GP-Based Electricity Price Forecasting - Springer Link

LNCS 4261 - Image Annotations Based on Semi ... - Springer Link

LNCS 7601 - Optimal Medial Surface Generation for ... - Springer Link

LNCS 4258 - Privacy for Public Transportation - Springer Link

LNCS 2747 - A Completeness Property of Wilke's Tree ... - Springer Link

LNCS 7575 - Multi-component Models for Object ... - Springer Link

LNCS 6942 - On the Configuration-LP for Scheduling ... - Springer Link