# Vanna–Volga pricing

The Vanna–Volga method is a mathematical tool used in finance. It is a technique for pricing first-generation exotic options in foreign exchange market (FX) derivatives.

It consists of adjusting the Black–Scholes theoretical value (BSTV) by the cost of a portfolio which hedges three main risks associated to the volatility of the option: the Vega $\mathcal{V}$, the Vanna and the Volga. The Vanna is the sensitivity of the Vega with respect to a change in the spot FX rate:

$\textrm{Vanna} = \frac{\partial \mathcal{V}}{\partial S}$.

Similarly, the Volga is the sensitivity of the Vega with respect to a change of the implied volatility $\sigma$:

$\textrm{Volga}= \frac{\partial \mathcal{V}}{\partial \sigma}$.

If we consider a smile volatility term structure $\sigma(K)$ with ATM strike $K_0$, ATM volatility $\sigma_0$, and where $K_{c/p}$ are the 25-Delta call/put strikes (obtained by solving the equations $\Delta_{ call}(K_c,\sigma_0)=1/4$ and $\Delta_{ put}(K_p,\sigma_0)=-1/4$ where $\Delta_{call/put}(K,\sigma)$ denotes the Black–Scholes Delta sensitivity) then the hedging portfolio will be composed of the at-the-money (ATM), risk-reversal (RR) and butterfly (BF) strategies:

\begin{align} \textrm{ATM}(K_0) &= \frac12 \left(\textrm{Call}(K_0,\sigma_0) + \textrm{Put}(K_0,\sigma_0)\right) \\ \textrm{RR}(K_c,K_p) &= \frac{}{}\textrm{Call}(K_c,\sigma(K_c))-\textrm{Put}(K_p,\sigma(K_p)) \\ \textrm{BF}(K_c,K_p) &= \frac12 \left(\textrm{Call}(K_c,\sigma(K_c)) + \textrm{Put}(K_p,\sigma(K_p))\right)- \textrm{ATM}(K_0) \end{align}

with $\textrm{Call}(K,\sigma)$ the Black–Scholes price of a call option (similarly for the put).

The simplest formulation of the Vanna–Volga method suggests that the Vanna–Volga price $X^{VV}$ of an exotic instrument $X$ is given by

$X^{\rm VV} = X ^{BS} + \underbrace{\frac{\textrm{X}_{vanna}}{\textrm{RR}_{vanna}}}_{w_{RR}} {RR}_{cost} + \underbrace{\frac{\textrm{X}_{volga}}{\textrm{BF}_{volga}}}_{w_{ BF}} {BF}_{cost}$

where by $X^{ BS}$ denotes the Black–Scholes price of the exotic and the Greeks are calculated with ATM volatility and

\begin{align} RR_{cost} &=\left[ \textrm{Call}(K_c,\sigma(K_c))-\textrm{Put}(K_p,\sigma(K_p)) \right] - \left[ \textrm{Call}(K_c,\sigma_0)-\textrm{Put}(K_p,\sigma_0) \right] \\ BF_{cost} &= \frac12 \left[ \textrm{Call}(K_c,\sigma(K_c))+\textrm{Put}(K_p,\sigma(K_p)) \right] - \frac12 \left[ \textrm{Call}(K_c,\sigma_0)+\textrm{Put}(K_p,\sigma_0) \right] \end{align}

These quantities represent a smile cost, namely the difference between the price computed with/without including the smile effect.

The rationale behind the above formulation of the Vanna-Volga price is that one can extract the smile cost of an exotic option by measuring the smile cost of a portfolio designed to hedge its Vanna and Volga risks. The reason why one chooses the strategies BF and RR to do this is because they are liquid FX instruments and they carry mainly Volga, and respectively Vanna risks. The weighting factors $w_{RR}$ and $w_{BF}$ represent respectively the amount of RR needed to replicate the option's Vanna, and the amount of BF needed to replicate the option's Volga. The above approach ignores the small (but non-zero) fraction of Volga carried by the RR and the small fraction of Vanna carried by the BF. It further neglects the cost of hedging the Vega risk. This has led to a more general formulation of the Vanna-Volga method in which one considers that within the Black–Scholes assumptions the exotic option's Vega, Vanna and Volga can be replicated by the weighted sum of three instruments:

$X_i = w_{ ATM}\, { ATM_i} + w_{ RR}\, {RR_i} + w_{BF}\, {BF_i} \, \, \, \, \, i\text{=vega, vanna, volga}$

where the weightings are obtained by solving the system: $\vec{x} = \mathbb{A} \vec{w}$

with

$\mathbb{A} = \begin{pmatrix} ATM_{vega} & RR_{vega} & BF_{vega} \\ ATM_{vanna} & RR_{vanna} & BF_{vanna} \\ ATM_{volga} & RR_{volga} & BF_{volga} \end{pmatrix}$, $\vec{w}= \begin{pmatrix} w_{ATM} \\ w_{RR} \\ w_{BF} \end{pmatrix}$, $\vec{x}= \begin{pmatrix} X_{vega} \\ X_{vanna} \\ X_{volga} \end{pmatrix}$

Given this replication, the Vanna–Volga method adjusts the BS price of an exotic option by the smile cost of the above weighted sum (note that the ATM smile cost is zero by construction):

\begin{align} X^{\rm VV} &= X^{BS} + w_{RR} ({RR}^{mkt}-{RR}^{BS}) + w_{BF} ({BF}^{mkt}-{BF}^{BS}) \\ &= X ^{BS} + \vec{x}^T(\mathbb{A}^T)^{-1}\vec{I} \\ & = X ^{BS} + X_{vega} \, \Omega_{vega}+ X_{vanna} \, \Omega_{vanna} + X_{volga} \, \Omega_{volga} \\ \end{align}

where

$\vec{I} = \begin{pmatrix} 0 \\ {RR}^{mkt} - {RR}^{BS}\\ {BF}^{mkt} - {BF}^{BS} \end{pmatrix}$

and

$\begin{pmatrix} \Omega_{vega} \\ \Omega_{vanna} \\ \Omega_{volga} \end{pmatrix} = (\mathbb{A}^T)^{-1}\vec{I}$

The quantities $\Omega_i$ can be interpreted as the market prices attached to a unit amount of Vega, Vanna and Volga, respectively. The resulting correction, however, typically turns out to be too large. Market practitioners thus modify $X^{VV}$ to

\begin{align} X^{\rm VV} &= X ^{BS} + p_{vanna} X_{vanna} \Omega_{vanna} + p_{volga} X_{volga} \Omega_{volga} \end{align}

The Vega contribution turns out to be several orders of magnitude smaller than the Vanna and Volga terms in all practical situations, hence one neglects it.

The terms $p_{vanna}$ and $p_{ volga}$ are put in by-hand and represent factors that ensure the correct behaviour of the price of an exotic option near a barrier: as the knock-out barrier level $B$ of an option is gradually moved toward the spot level $S_0$, the BSTV price of a knock-out option must be a monotonically decreasing function, converging to zero exactly at $B=S_0$. Since the Vanna-Volga method is a simple rule-of-thumb and not a rigorous model, there is no guarantee that this will be a priori the case. The attenuation factors are of a different from for the Vanna or the Volga of an instrument. This is because for barrier values close to the spot they behave differently: the Vanna becomes large while, on the contrary, the Volga becomes small. Hence the attenuation factors take the form:

\begin{align} p_{\rm vanna} &= a \, \gamma \\ p_{\rm volga} &= b + c \gamma \end{align}

where $\gamma\in[0,1]$ represents some measure of the barrier(s) vicinity to the spot with the features

\begin{align} \gamma=0 \ \ &{for}\ \ S_0\to B \\ \gamma=1 \ \ &{for}\ \ |S_0-B|\gg 0 \end{align}

The coefficients $a,b,c$ are found through calibration of the model to ensure that it reproduces the vanilla smile. Good candidates for $\gamma$ that ensure the appropriate behaviour close to the barriers are the survival probability and the expected first exit time. Both of these quantities offer the desirable property that they vanish close to a barrier.

## Survival probability

The survival probability $p_{surv}\in[0,1]$ refers to the probability that the spot does not touch one or more barrier levels $\{B_i\}$. For example, for a single barrier option we have

$p_{surv} = \mathbb{E}[ 1_{S_t

where $\mathrm{NT}(B)$ is the value of a no-touch option and $\mathrm{DF}(t_{\textrm{tod}},t_{\textrm{mat}})$ the discount factor between today and maturity. Similarly, for options with two barriers the survival probability is given through the undiscounted value of a double-no-touch option.

## First-exit time

The first exit time (FET) is the minimum between: (i) the time in the future when the spot is expected to exit a barrier zone before maturity, and (ii) maturity, if the spot has not hit any of the barrier levels up to maturity. That is, if we denote the FET by $u(S_t,t)$ then $u(S_t,t)=$min$\{\phi,T\}$ where $\phi=\textrm{inf}\{\ell\in [0,T)\}$ such that $S_{t+\ell} > H$ or $S_{t+\ell} where $L,H$ are the 'low' vs 'high' barrier levels and $S_t$ the spot of today.

The first-exit time is the solution of the following PDE

$\frac{\partial u(S,t) }{\partial t} + \frac12\sigma^2 S^2 \frac{\partial^2 u(S,t) }{\partial S^2} + \mu S \frac{\partial u(S,t) }{\partial S} =0$

This equation is solved backwards in time starting from the terminal condition $u(S,T) = T$ where $T$ is the time to maturity and boundary conditions $u(L,t')=u(H,t')=t'$. In case of a single barrier option we use the same PDE with either $H\gg S_0$ or $L\ll S_0$. The parameter $\mu$ represents the risk-neutral drift of the underlying stochastic process.