# Busemann function

In geometric topology, Busemann functions are used to study the large-scale geometry of geodesics in Hadamard spaces and in particular Hadamard manifolds (simply connected complete Riemannian manifolds of nonpositive curvature). They are named after Herbert Busemann, who introduced them; he gave an extensive treatment of the topic in his 1955 book "The geometry of geodesics".

## Definition and elementary properties

Let ${\displaystyle (X,d)}$ be a metric space. A geodesic ray is a path ${\displaystyle \gamma :[0,\infty )\to X}$ which minimizes distance everywhere along its length. i.e., for all ${\displaystyle t,t'\in [0,\infty )}$,

${\displaystyle d{\big (}\gamma (t),\gamma (t'){\big )}={\big |}t-t'{\big |}}$.

Equivalently, a ray is an isometry from the "canonical ray" (the set ${\displaystyle [0,\infty )}$ equipped with the Euclidean metric) into the metric space X.

Given a ray γ, the Busemann function ${\displaystyle B_{\gamma }:X\to \mathbb {R} }$ is defined by

${\displaystyle B_{\gamma }(x)=\lim _{t\to \infty }{\big (}d{\big (}\gamma (t),x{\big )}-t{\big )}}$

Thus, when t is very large, the distance ${\displaystyle d{\big (}\gamma (t),x{\big )}}$ is approximately equal to ${\displaystyle t-B_{\gamma }(x)}$. Given a ray γ, its Busemann function is always well-defined: indeed the right hand side Ft(x) above tends pointwise to the left hand side on compacta, since ${\displaystyle t-d(\gamma (t),x)=d(\gamma (t),\gamma (0))-d(\gamma (t),x)}$ is bounded above by ${\displaystyle d(\gamma (0),x)}$ and increasing since, if ${\displaystyle s\leq t}$,

${\displaystyle t-s+d(x,\gamma (s))-d(x,\gamma (t))\geq t-s-d(\gamma (s),\gamma (t))=0.}$

It is immediate from the triangle inequality that

${\displaystyle |B_{\gamma }(x)-B_{\gamma }(y)|\leq d(x,y),}$

so that ${\displaystyle B_{\gamma }}$ is uniformly continuous. More specifically, the above estimate above shows that

By Dini's theorem, the functions ${\displaystyle F_{t}(x)=d(x,\gamma (t))-t}$ tend to ${\displaystyle B_{\gamma }(x)}$ uniformly on compacta as t tends to infinity.

## Example: Poincaré disk

Let D be the unit disk in the complex plane with the Poincaré metric

${\displaystyle ds^{2}={4\,|dz|^{2} \over (1-|z|^{2})^{2}}.}$

Then, for |z| < 1 and |ζ| = 1, the Busemann function is given by

${\displaystyle B_{\zeta }(z)=-\log \,\left({1-|z|^{2} \over |z-\zeta |^{2}}\right),}$

where the term in brackets on the right hand side is the Poisson kernel for the unit disk and ζ corresponds to the radial geodesic γ from the origin towards ζ, γ(t) = ζ tanh(t/2). In fact the computation of d(x,y) can be reduced to that of d(z,0) = d(|z|,0) = tanh−1 |z| = log (1+|z|)/(1-|z|), since the metric is invariant under Möbius transformations in SU(1,1); the geodesics through 0 have the form ζ gt(0) where gt is the 1-parameter subgroup of SU(1,1)

${\displaystyle g_{t}={\begin{pmatrix}\cosh t/2&\sinh t/2\\\sinh t/2&\cosh t/2\end{pmatrix}}.}$

The formula above also completely determines the Busemann function by Möbius invariance. Note that

${\displaystyle {1-|z|^{2} \over |z-\zeta |^{2}}\leq {1-|z|^{2} \over (1-|z|)^{2}}=1-\left({|z| \over 1-|z|}\right)^{2}\leq 1,}$

so that the Busemann function in this case is non-negative.[2]

## Busemann functions on a Hadamard space

In a Hadamard space, where any two points are joined by a unique geodesic segment, the function F = Ft is convex, i.e. convex on geodesic segments [x,y]. Explicitly this means that if z(s) is the point which divides [x,y] in the ratio s : (1 − s), then F(z(s)) ≤ s F(x) + (1 − s) F(y). In fact for fixed a the function d(x,a) is convex and hence so are its translates; in particular, if γ is a geodesic ray in X, then Ft is convex. Since the Busemann function Bγ is the pointwise limit of Ft,

• Busemann functions are convex on Hadamard spaces.[3]
• On a Hadamard space, the functions ${\displaystyle F_{t}(y)=d(y,\gamma (t))-t}$ converge uniformly to ${\displaystyle B_{\gamma }}$ uniformly on any bounded subset of X.[4][5]

In fact let h(t) = d(y,γ(t)) − t = Ft(y). Since γ(t) is parametrised by arclength, Alexandrov's first comparison theorem for Hadamard spaces implies that the function g(t) = d(y,γ(t))2t2 is convex. Hence for 0< s < t

${\displaystyle g(s)\leq (1-{s \over t})g(0)+{s \over t}g(t).}$

Thus

${\displaystyle 2sh(s)\leq (h(s)+s)^{2}-s^{2}=g(s)\leq (1-{s \over t})d(x,y)^{2}+{s \over t}(2th(t)+h(t)^{2})\leq d(x,y)^{2}+2sh(t)+{s \over t}d(x,y)^{2},}$

so that

${\displaystyle |F_{s}(y)-F_{t}(y)|=|h(s)-h(t)|\leq {1 \over 2}(s^{-1}+t^{-1})d(x,y)^{2}.}$

Letting t tend to ∞, it follows that

${\displaystyle |F_{s}(y)-B_{\gamma }(y)|\leq {d(x,y)^{2} \over 2s},}$

so convergence is uniform on bounded sets.

Note that the inequality above for ${\displaystyle |F_{s}(y)-F_{t}(y)|}$ (together with its proof) also holds for geodesic segments: if Γ(t) is a geodesic segment starting at x and parametrised by arclength then

${\displaystyle |d(y,\Gamma (s))-s-d(y,\Gamma (t))+t|\leq (s^{-1}+t^{-1})d(x,y)^{2}.}$

Next suppose that x, y are points in a Hadamard space, and let δ(s) be the geodesic through x with δ(0) = y and δ(t) = x, where t = d(x,y). This geodesic cuts the boundary of the closed ball B(y,r) at the point δ(r). Thus if d(x,y) > r, there is a point v with d(y,v) = r such that d(x,v) = d(x,y) − r.

This condition persists for Busemann functions. The statement and proof of the property for Busemann functions relies on a fundamental theorem on closed convex subsets of a Hadamard space, which generalises orthogonal projection in a Hilbert space: if C is a closed convex set in a Hadamard space X, then every point x in X has a unique closest point P(x) ≡ PC(x) in C and d(P(x),P(y)) ≤ d(x,y); moreover a = P(x) is uniquely determined by the property that, for y in C,

${\displaystyle d(x,y)^{2}\geq d(x,a)^{2}+d(a,y)^{2},}$

so that the angle at a in the Euclidean comparison triangle for a,x,y is greater than or equal to π/2.

• If h is a Busemann function on a Hadamard space, then, given y in X and r > 0, there is a unique point v with d(y,v) = r such that h(v) = h(y) − r. For fixed r > 0, the point v is the closest point of y to the closed convex C set of points u such that h(u) ≤ h(y) − r and therefore depends continuously on y.[6]

In fact let v be the closest point to y in C. Then h(v) = h(y) − r and so h is minimised by v in B(y,R) where R = d(y,v) and v is the unique point where h is minimised. By the Lipschitz condition r = |h(y) − h(v)| ≤ R. To prove the assertion, it suffices to show that R = r, i.e. d(y,v) = r. On the other hand, h is the uniform limit on any closed ball of functions hn. On B(y,r), these are minimised by points vn with hn(vn) = hn(y) − r. Hence the infimum of h on B(y,r) is h(y) − r and h(vn) tends to h(y) − r. Thus h(vn) = h(y) − rn with rnr and rn tending towards r. Let un be the closest point to y with h(un) ≤ h(y) − rn. Let Rn = d(y,un) ≤ r. Then h(un) = h(y) − rn, and, by the Lipschitz condition on h, Rnrn. In particular Rn tends to r. Passing to a subsequence if necessary it can be assumed that rn and Rn are both increasing (to r). The inequality for convex optimisation implies that for n > m.

${\displaystyle d(u_{n},u_{m})^{2}\leq R_{n}^{2}-R_{m}^{2}\leq 2r|R_{n}-R_{m}|,}$

so that un is a Cauchy sequence. If u is its limit, then d(y,u) = r and h(u) = h(y) − r. By uniqueness it follows that u = v and hence d(y,v) = r, as required.

Uniform limits. The above argument proves more generally that if d(xn,x0) tends to infinity and the functions hn(x) = d(x,xn) – d(xn,x0) tend uniformly on bounded sets to h(x), then h is convex, Lipschitz with Lipschitz constant 1 and, given y in X and r > 0, there is a unique point v with d(y,v) = r such that h(v) = h(y) − r. If on the other hand the sequence (xn) is bounded, then the terms all lie in some closed ball and uniform convergence there implies that (xn) is a Cauchy sequence so converges to some x in X. So hn tends uniformly to h(x) = d(x,x) – d(x,x0), a function of the same form. The same argument also shows that the class of functions which satisfy the same three conditions (being convex, Lipschitz and having minima on closed balls) is closed under taking uniform limits on bounded sets.

Comment. Note that, since any closed convex subset of a Hadamard subset of a Hadamard space is also a Hadamard space, any closed ball in a Hadamard space is a Hadamard space. In particular it need not be the case that every geodesic segment is contained in a geodesic defined on the whole of R or even a semi-infinite interval [0,∞). The closed unit ball of a Hilbert space gives an explicit example which is not a proper metric space.

• If h is a convex function, Lipschitz with constant 1 and h assumes its minimum on any closed ball centred on y with radius r at a unique point v on the boundary with h(v) = h(y) − r, then for each y in X there is a unique geodesic ray δ such that δ(0) = y and δ cuts each closed convex set h ≤ h(y) – r with r > 0 at δ(r), so that h(δ(t)) = h(y) – t. In particular this holds for each Busemann function.[7]

In fact the third condition implies that v is the closest point to y in the closed convex set Cr of points u such that h(u) ≤ h(y) – r. Let δ(t) for 0 ≤ tr be the geodesic joining y to v. Then k(t) = h(δ(t)) - h(y) is a convex Lipschitz function on [0,r] with Lipschitz constant 1 satisfying k(t) ≤ – t and k(0) = 0 and k(r) = –r. So k vanishes everywhere, since if 0 < s < r, k(s) ≤ –s and |k(s)| ≤ s. Hence h(δ(t)) = h(y) – t. By uniqueness it follows that δ(t) is the closest point to y in Ct and that it is the unique point minimising h in B(y,t). Uniqueness implies that these geodesics segments coincide for arbitrary r and therefore that δ extends to a geodesic ray with the stated property.

• If h = hγ , then the geodesic ray δ starting at y satisfies ${\displaystyle \sup d(\gamma (t),\delta (t))<\infty }$. If δ1 is another ray starting at y with ${\displaystyle \sup d(\delta (t),\delta _{1}(t))<\infty }$ then δ1 = δ.

To prove the first assertion, it is enough to check this for t sufficiently large. In that case γ(t) and δ(th(y)) are the projections of x and y onto the closed convex set h ≤ −t. Therefore, d(γ(t),δ(th(y))) ≤ d(x,y). Hence d(γ(t),δ(t)) ≤ d(γ(t),δ(th(y))) + d(δ(th(y)),δ(t)) ≤ d(x,y) + |h(y)|. The second assertion follows because d1(t),δ(t)) is convex and bounded on [0,∞), so, if it vanishes at t = 0, must vanish everywhere.

• Suppose that h is a continuous convex function and for each y in X there is a unique geodesic ray δ such that δ(0) = y and δ cuts each closed convex set h ≤ h(y) – r with r > 0 at δ(r), so that h(δ(t)) = h(y) – t; then h is a Busemann function. In fact h − hδ is a constant function.[7]

Let Cr be the closed convex set of points z with h(z) ≤ −r. Since X is a Hadamard space for every point y in X there is a unique closest point Pr(y) to y in Cr. It depends continuously on y and if y lies outside Cr, then Pr(y) lies on the hypersurface h(z) = − r—the boundary ∂Cr of Cr—and Pr(y) satisfies the inequality of convex optimisation. Let δ(s) be the geodesic ray starting at y.

Fix x in X. Let γ(s) be the geodesic ray starting at x. Let g(z) = hγ(z), the Busemann function for γ with base point x. In particular g(x) = 0. It suffices to show that g = hh(y)1. Now take y with h(x) = h(y) and let δ(t) be the geodesic ray starting at y corresponding to h. Then

${\displaystyle d(x,y)\geq d(\gamma (t),\delta (t)),\,\,\,d(x,\delta (t))^{2}\geq d(x,\gamma (t))^{2}+d(\gamma (t),\delta (t))^{2},\,\,\,d(y,\gamma (t))^{2}\geq d(y,\delta (t))^{2}+d(\gamma (t),\delta (t))^{2}.}$

On the other hand, for any four points a, b, c, d in a Hadamard space, the following quadrilateral inequality of Reshetnyak holds:

${\displaystyle |d(a,c)^{2}+d(b,d)^{2}-d(a,d)^{2}-d(b,d)^{2}|\leq 2d(a,b)d(c,d).}$

Setting a = x, b = y, c = γ(t), d = δ(t), it follows that

${\displaystyle |d(y,\gamma (t))^{2}-d(x,\delta (t))^{2}|\leq 2d(x,y)^{2},}$

so that

${\displaystyle |d(y,\gamma (t))-d(x,\gamma (t))|\leq 2{d(x,y)^{2} \over d(y,\gamma (t))+d(x,\gamma (t))}\leq {d(x,y)^{2} \over t}.}$

Hence hγ(y) = 0. Similarly hδ(x) = 0. Hence hγ(y) = 0 on the level surface of h containing x. Now for t ≥ 0 and z in X, let αt(z) = γ1(t) the geodesic ray starting at z. Then αs + t = αs ∘ αt and h ∘ αt = ht. Moreover, by boundedness, dt(u),αt(v)) ≤ d(u,v). The flow αs can be used to transport this result to all the level surfaces of h. In fact for general y1, if h(y1) < h(x), take s > 0 such that hs(x)) = h(y1) and set x1 = αs(x). Then hγ1(y1) = 0, where γ1(t) = αt(x1) = γ(s + t). But then hγ1 = hγs, so that hγ(y1) = s. Hence g(y1) = s = h((αs(x)) – h(x) = h(y1) – h(x), as required. Similarly if h(y1) > h(x), take s > 0 such that hs(y1)) = h(x). Let y = αs(y1). Then hγ(y) = 0, so hγ(y1) = –s. Hence g(y1) = –s = h(y1) – h(x), as required.

Finally there are necessary and sufficient conditions for two geodesics to define the same Busemann function up to constant:

• On a Hadamard space, the Busemann functions of two geodesic rays ${\displaystyle \gamma _{1}}$ and ${\displaystyle \gamma _{2}}$ differ by a constant if and only if ${\displaystyle \sup _{t\geq 0}d(\gamma _{1}(t),\gamma _{2}(t))<\infty }$.[8]

Suppose firstly that γ and δ are two geodesic rays with Busemann functions differing by a constant. Shifting the argument of one of the geodesics by a constant, it may be assumed that Bγ = Bδ = B, say. Let C be the closed convex set on which B(x) ≤ −r. Then B(γ(t)) = Bγ(γ(t)) = −t and similarly B(δ(t)) = − t. Then for sr, the points γ(s) and δ(s) have closest points γ(r) and δ(r) in C, so that d(γ(r), δ(r)) ≤ d(γ(s), δ(s)). Hence supt ≥ 0 d(γ(t), δ(t)) < ∞.

Now suppose that supt ≥ 0 d1(t), γ2(t)) < ∞. Let δi(t) be the geodesic ray starting at y associated with hγi. Then supt ≥ 0 di(t), δi(t)) < ∞. Hence supt ≥ 0 d1(t), δ2(t)) < ∞. Since δ1 and δ2 both start at y, it follows that δ1(t) ≡ δ2(t). By the previous result hγi and hδi differ by a constant; so hγ1 and hγ2 differ by a constant.

To summarise, the above results give the following characterisation of Busemann functions on a Hadamard space:[7]

THEOREM. On a Hadamard space, the following conditions on a function f are equivalent:

• h is a Busemann function.
• h is a convex function, Lipschitz with constant 1 and h assumes its minimum on any closed ball centred on y with radius r at a unique point v on the boundary with h(v) = h(y) − r.
• h is a continuous convex function and for each y in X there is a unique geodesic ray δ such that δ(0) = y and, for any r >0, the ray δ cuts each closed convex set h ≤ h(y) – r at δ(r).

## Bordification of a Hadamard space

In the previous section it was shown that if X is a Hadamard space and x0 is a fixed point in X then the union of the space of Busemann functions vanishing at x0 and the space of functions hy(x) = d(x,y) − d(x0,y) is closed under taking uniform limits on bounded sets. This result can be formalised in the notion of bordification of X.[9] In this topology, the points xn tend to a geodesic ray γ starting at x0 if and only if d(x0,xn) tends to ∞ and for t > 0 arbitrarily large the sequence obtained by taking the point on each segment [x0,xn] at a distance t from x0 tends to γ(t).

If X is a metric space, Gromov's bordification can be defined as follows. Fix a point x0 in X and let XN = B(x0,N). Let Y = C(X) be the space of Lipschitz continuous functions on X, .e. those for which |f(x) – f(y)| ≤ A d(x,y) for some constant A > 0. The space Y can be topologised by the seminorms ||f||N = supXN |f|, the topology of uniform convergence on bounded sets. The seminorms are finite by the Lipschitz conditions. This is the topology induced by the natural map of C(X) into the direct product of the Banach spaces Cb(XN) of continuous bounded functions on XN. It is give by the metric D(f,g) = ∑ 2N ||fg||N(1 +||fg||N)−1.

The space X is embedded into Y by sending x to the function fx(y) = d(y,x) – d(x0,x). Let X be the closure of X in Y. Then X is metrisable, since Y is, and contains X as an open subset; moreover bordifications arising from different choices of basepoint are naturally homeomorphic. Let h(x) = (d(x,x0) + 1)−1. Then h lies in C0(X). It is non-zero on X and vanishes only at ∞. Hence it extends to a continuous function on X with zero set X \ X. It follows that X \ X is closed in X, as required. To check that X = X(x0) is independent of the basepoint, it suffices to show that k(x) = d(x,x0) − d(x,x1) extends to a continuous function on X. But k(x) = fx(x1), so, for g in X, k(g) = g(x1). Hence the correspondence between the compactifications for x0 and x1 is given by sending g in X(x0) to g + g(x1)1 in X(x1).

When X is a Hadamard space, Gromov's ideal boundary ∂X = X \ X can be realised explicitly as "asymptotic limits" of geodesic rays using Busemann functions. In fact if xn is an unbounded sequence in X with hn(x) = d(x,xn) − d(xn,x0) tending to h in Y, then h vanishes at x0, is convex, Lipschitz with Lipschitz constant 1 and has minimum h(y) − r on any closed ball B(y,r). Hence h is a Busemann function Bγ corresponding to a unique geodesic ray γ starting at x0.

On the other hand, hn tends to Bγ uniformly on bounded sets if and only if d(x0,xn) tends to ∞ and for t > 0 arbitrarily large the sequence obtained by taking the point on each segment [x0,xn] at a distance t from x0 tends to γ(t). In fact for d(x0,xn) ≥ t, let xn(t) be the point in [x0,xn] with d(x0,xn(t)) = t. Suppose first that hn tends to Bγ uniformly on B(x0,R). Then for tR, |hn(γ(t)) – Bγ(γ(t))|=d(γ(t),xn) – d(xn,x0) + t. This is a convex function. It vanishes as t = 0 and hence is increasing. So it is maximised at t = R. So for each t, |d(γ(t),xn) – d(xn,x0) – t| tends towards 0. Let a = X0, b = γ(t) and c = xn. Then d(c,a) – d(c,b) is close to d(a,b) with d(c,a) large. Hence in the Euclidean comparison triangle CA - CB is close to AB with CA large. So the angle at A is small. So the point D on AC at the same distance as AB lies close to B. Hence, by the first comparison theorem for geodesic triangles, d(xn(t),γ(t)) is small. Conversely suppose that for fixed t and n sufficiently large d(xn(t),γ(t)) tends to 0. Then from the above Fs(y) = d(y,γ(s)) – s satisfies

${\displaystyle |F_{s}(y)-B_{\gamma }(y)|\leq {d(x_{0},y)^{2} \over 2s},}$

so it suffices show that on any bounded set hn(y) = d(y,xn) – d(x0,xn) is uniformly close to Fs(y) for n sufficiently large.[10]

For a fixed ball B(x0,R), fix s so that R2/s ≤ ε. The claim is then an immediate consequence of the inequality for geodesic segments in a Hadamard space, since

${\displaystyle |d(y,x_{n})-d(y,x_{0})-d(y,x_{n}(s))+s|\leq {d(x_{0},y)^{2} \over s}\leq \varepsilon .}$

Hence, if y in B(x0,R) and n is sufficiently large that d(xn(s),γ(s)) ≤ ε, then

${\displaystyle |h_{n}(y)-B_{\gamma }(y)|=|d(y,x_{n})-d(y,x_{0})-B_{\gamma }(y)|\leq |d(y,x_{n})-d(y,x_{0})-d(y,x_{n}(s))+s|+d(x_{n}(s),\gamma (s))+|F_{s}(y)-B_{\gamma }(y)|\leq 3\varepsilon .}$

## Busemann functions on a Hadamard manifold

Suppose that x, y are points in a Hadamard manifold and let γ(s) be the geodesic through x with γ(0) = y. This geodesic cuts the boundary of the closed ball B(y,r) at the two points γ(±r). Thus if d(x,y) > r, there are points u, v with d(y,u) = d(y,v) = r such that |d(x,u) − d(x,v)| = 2r. By continuity this condition persists for Busemann functions:

• If h is a Busemann function on a Hadamard manifold, then, given y in X and r > 0, there are unique points u, v with d(y,u) = d(y,v) = r such that h(u) = h(y) + r and h(v) = h(y) − r. For fixed r > 0, the points u and v depend continuously on y.[3]

In fact taking a sequence tn tending to ∞ and hn = Ftn, there are points un and vn which satisfy these conditions for hn for n sufficiently large. Passing to a subsequence if necessary, it can be assumed that un and vn tend to u and v. By continuity these points satisfy the conditions for h. To prove uniqueness, note that by compactness h assumes its maximum and minimum on B(y,r). The Lipschitz condition shows that the values of h there differ by at most 2r. Hence h is minimized at v and maximized at u. On the other hand, d(u,v) = 2r and for u and v the points v and u are the unique points in B(y,r) maximizing this distance. The Lipschitz condition on h then immediately implies u and v must be the unique points in B(y,r) maximizing and minimizing h. Now suppose that yn tends to y. Then the corresponding points un and vn lie in a closed ball so admit convergent subsequences. But by uniqueness of u and v any such subsequences must tend to u and v, so that un and vn must tend to u and v, establishing continuity.

The above result holds more generally in a Hadamard space.[11]

• If h is a Busemann function on a Hadamard manifold, then h is continuously differentiable with ||dh(y)|| = 1 for all y.[3]

From the previous properties of h, for each y there is a unique geodesic γ(t) parametrised by arclength with γ(0) = y such that h ∘ γ(t) = h(y) + t. It has the property that it cuts ∂B(y,r) at t = ±r: in the previous notation γ(r) = u and γ(–r) = v. The vector field Vh defined by the unit vector ${\displaystyle {\dot {\gamma }}(0)}$ at y is continuous, because u is a continuous function of y and the map sending (x,v) to (x,expx v) is a diffeomorphism from TX onto X × X by the Cartan-Hadamard theorem. Let δ(s) be another geodesic parametrised by arclength through y with δ(0) = y. Then dh ∘ δ (0)/ ds = ${\displaystyle ({\dot {\delta }}(0),{\dot {\gamma }}(0))}$. Indeed, let H(x) = h(x) − h(y), so that H(y) = 0. Then

${\displaystyle |H(\delta (s))-H(x)|\leq d(\delta (s),x).}$

Applying this with x = u and v, it follows that for s > 0

${\displaystyle (r-d(\delta (s),u))/s\leq (h(\delta (s))-h(y))/s\leq (d(\delta (s),v)-r)/s.}$

The outer terms tend to ${\displaystyle ({\dot {\delta }}(0),{\dot {\gamma }}(0))}$ as s tends to 0, so the middle term has the same limit, as claimed. A similar argument applies for s < 0.

The assertion on the outer terms follows from the first variation formula for arclength, but can be deduced directly as follows. Let ${\displaystyle a={\dot {\delta }}(0)}$ and ${\displaystyle b={\dot {\gamma }}(0)}$, both unit vectors. Then for tangent vectors p and q at y in the unit ball[12]

${\displaystyle d(\exp _{y}p,\exp _{y}q)=\|p-q\|+\varepsilon \max \|p\|^{2},\|q\|^{2}}$

with ε uniformly bounded. Let s = t3 and r = t2. Then

${\displaystyle (d(\delta (s),v)-r)/s=(d(\exp _{y}(t^{3}a),\exp _{y}(-t^{2}b))-t^{2})/t^{3}=(\|t^{3}a+t^{2}b\|-t^{2})/t^{3}+\varepsilon |t|=(\|ta+b\|-1)/t+\varepsilon |t|.}$

The right hand side here tends to (a,b) as t tends to 0 since

${\displaystyle {d \over dt}\|b+ta\||_{t=0}={1 \over 2}\,{d \over dt}\|b+ta\|^{2}|_{t=0}=(a,b).}$

The same method works for the other terms.

Hence it follows that h is a C1 function with dh dual to the vector field Vh, so that ||dh(y)|| = 1. The vector field Vh is thus the gradient vector field for h. The geodesics through any point are the flow lines for the flow αt for Vh, so that αt is the gradient flow for h.

THEOREM. On a Hadamard manifold X the following conditions on a continuous function h are equivalent:[3]

1. h is a Busemann function.
2. h is a convex, Lipschitz function with constant 1, and for each y in X there are points u± at a distance r from y such that h(u±) = h(y) ± r.
3. h is a convex C1 function with ||dh(x)|| ≡ 1.

It has already been proved that (1) implies (2).

The arguments above show mutatis mutandi that (2) implies (3).

It therefore remains to show that (3) implies (1). Fix x in X. Let αt be the gradient flow for h. It follows that h ∘ αt (x) = h(x) + t and that γ(t) = αt(x) is a geodesic through x parametrised by arclength with γ(0) = x. Indeed, if s < t, then

${\displaystyle |s-t|=|h(\alpha _{s}(x))-h(\alpha _{t}(x))|\leq d(\alpha _{s}(x),\alpha _{t}(x))\leq \int _{s}^{t}\|d\alpha _{\tau }(x)/d\tau \|\,d\tau =\int _{s}^{t}\|dh(\alpha _{\tau }(x))\|\,d\tau =|s-t|,}$

so that d(γ(s),γ(t)) = |st|. Let g(y) = hγ(y), the Busemann function for γ with base point x. In particular g(x) = 0. To prove (1), it suffices to show that g = hh(x)1.

Let C(−r) be the closed convex set of points z with h(z) ≤ −r. Since X is a Hadamard space for every point y in X there is a unique closest point Pr(y) to y in C(-r). It depends continuously on y and if y lies outside C(-r), then Pr(y) lies on the hypersurface h(z) = − r—the boundary ∂C(–r) of C(–r)—and the geodesic from y to Pr(y) is orthogonal to ∂C(–r). In this case the geodesic is just αt(y). Indeed, the fact that αt is the gradient flow of h and the conditions ||dh(y)|| ≡ 1 imply that the flow lines αt(y) are geodesics parametrised by arclength and cut the level curves of h orthogonally. Taking y with h(y) = h(x) and t > 0,

${\displaystyle d(x,y)\geq d(\alpha _{t}(x),\alpha _{t}(y)),\,\,\,d(x,\alpha _{t}(y))^{2}\geq d(x,\alpha _{t}(x))^{2}+d(\alpha _{t}(x),\alpha _{t}(y))^{2},\,\,\,d(y,\alpha _{t}(x))^{2}\geq d(y,\alpha _{t}(y))^{2}+d(\alpha _{t}(x),\alpha _{t}(y))^{2}.}$

On the other hand, for any four points a, b, c, d in a Hadamard space, the following quadrilateral inequality of Reshetnyak holds:

${\displaystyle |d(a,c)^{2}+d(b,d)^{2}-d(a,d)^{2}-d(b,d)^{2}|\leq 2d(a,b)d(c,d).}$

Setting a = x, b = y, c = αt(x), d = αt(y), it follows that

${\displaystyle |d(y,\alpha _{t}(x))^{2}-d(x,\alpha _{t}(x))^{2}|\leq 2d(x,y)^{2},}$

so that

${\displaystyle |d(y,\alpha _{t}(x))-d(x,\alpha _{t}(x))|\leq 2{d(x,y)^{2} \over d(y,\alpha _{t}(x))+d(x,\alpha _{t}(x))}\leq {d(x,y)^{2} \over t}.}$

Hence hγ(y) = 0 on the level surface of h containing x. The flow αs can be used to transport this result to all the level surfaces of h. In fact for general y1 take s such that hs(x)) = h(y1) and set x1 = αs(x). Then hγ1(y1) = 0, where γ1(t) = αt(x1) = γ(s + t). But then hγ1 = hγs, so that hγ(y1) = s. Hence g(y1) = s = h((αs(x)) – h(x) = h(y1) – h(x), as required.

Note that this argument could be shortened using the fact that two Busemann functions hγ and hδ differ by a constant if and only if the corresponding geodesic rays satisfy supt ≥ 0 d(γ(t),δ(t)) < ∞. Indeed, all the geodesics defined by the flow αt satisfy the latter condition, so differ by constants. Since along any of these geodesics h is linear with derivative 1, h must differ from these Busemann functions by constants.

## Compactification of a proper Hadamard space

Eberlein & O'Neill (1973) defined a compactification of a Hadamard manifold X which uses Busemann functions. Their construction, which can be extended more generally to proper (i.e. locally compact) Hadamard spaces, gives an explicit geometric realisation of a compactification defined by Gromov—by adding an "ideal boundary"—for the more general class of proper metric spaces X, those for which every closed ball is compact. Note that, since any Cauchy sequence is contained in a closed ball, any proper metric space is automatically complete.[13] The ideal boundary is a special case of the ideal boundary for a metric space. In the case of Hadamard spaces, this agrees with the space of geodesic rays emanating from any fixed point described using Busemann functions in the bordification of the space.

If X is a proper metric space, Gromov's compactification can be defined as follows. Fix a point x0 in X and let XN = B(x0,N). Let Y = C(X) be the space of Lipschitz continuous functions on X, .e. those for which |f(x) – f(y)| ≤ A d(x,y) for some constant A > 0. The space Y can be topologised by the seminorms ||f||N = supXN |f|, the topology of uniform convergence on compacta. This is the topology induced by the natural map of C(X) into the direct product of the Banach spaces C(XN). It is give by the metric D(f,g) = ∑ 2N ||fg||N(1 +||fg||N)−1.

The space X is embedded into Y by sending x to the function fx(y) = d(y,x) – d(x0,x). Let X be the closure of X in Y. Then X is compact (metrisable) and contains X as an open subset; moreover compactifications arising from different choices of basepoint are naturally homeomorphic. Compactness follows from the Arzelà–Ascoli theorem since the image in C(XN) is equicontinuous and uniformly bounded in norm by N. Let xn be a sequence in XX tending to y in X \ X. Then all but finitely many terms must lie outside XN since XN is compact, so that any subsequence would converge to a point in XN; so the sequence xn must be unbounded in X. Let h(x) = (d(x,x0) + 1)−1. Then h lies in C0(X). It is non-zero on X and vanishes only at ∞. Hence it extends to a continuous function on X with zero set X \ X. It follows that X \ X is closed in X, as required. To check that the compactification X = X(x0) is independent of the basepoint, it suffices to show that k(x) = d(x,x0) − d(x,x1) extends to a continuous function on X. But k(x) = fx(x1), so, for g in X, k(g) = g(x1). Hence the correspondence between the compactifications for x0 and x1 is given by sending g in X(x0) to g + g(x1)1 in X(x1).

When X is a Hadamard manifold (or more generally a proper Hadamard space), Gromov's ideal boundary ∂X = X \ X can be realised explicitly as "asymptotic limits" of geodesics by using Busemann functions. Fixing a base point x0, there is a unique geodesic γ(t) parametrised by arclength such that γ(0) = x0 and ${\displaystyle {\dot {\gamma }}(0)}$ is a given unit vector. If Bγ is the corresponding Busemann function, then Bγ lies in ∂X(x0) and induces a homeomorphism of the unit (n − 1)-sphere onto ∂X(x0), sending ${\displaystyle {\dot {\gamma }}(0)}$ to Bγ.

## Quasigeodesics in the Poincaré disk, CAT(-1) and hyperbolic spaces

### Morse–Mostow lemma

In the case of spaces of negative curvature, such as the Poincaré disk, CAT(-1) and hyperbolic spaces, there is a metric structure on their Gromov boundary. This structure is preserved by the group of quasi-isometries which carry geodesics rays to quasigeodesic rays. Quasigeodesics were first studied for negatively curved surfaces—in particular the hyperbolic upper halfplane and unit disk—by Morse and generalised to negatively curved symmetric spaces by Mostow, for his work on the rigidity of discrete groups. The basic result is the Morse–Mostow lemma on the stability of geodesics.[14][15][16][17]

By definition a quasigeodesic Γ defined on an interval [a,b] with −∞ ≤ a < b ≤ ∞ is a map Γ(t) into a metric space, not necessarily continuous, for which there are constants λ ≥ 1 and ε > 0 such that for all s and t:

${\displaystyle \lambda ^{-1}|s-t|-\varepsilon \leq d(\Gamma (s),\Gamma (t))\leq \lambda |s-t|+\varepsilon .}$

The following result is essentially due to Marston Morse (1924).

Morse's lemma on stability of geodesics. In the hyperbolic disk there is a constant R depending on λ and ε such that such that any quasigeodesic segment Γ defined on a finite interval [a,b] is within a Hausdorff distance R of the geodesic segment [Γ(a),Γ(b)].[18][19]

### Classical proof for Poincaré disk

The classical proof of Morse's lemma for the Poincaré unit disk or upper halfplane proceeds more directly by using orthogonal projection onto the geodesic segment.[20][21][22]

• It can be assumed that Γ satisfies the stronger "pseudo-geodesic" condition:[23]
${\displaystyle \lambda ^{-1}|s-t|-\varepsilon \leq d(\Gamma (s),\Gamma (t))\leq \lambda |s-t|.}$
In fact Γ can be replaced by a continuous piecewise geodesic curve Δ with the same endpoints lying at a finite Hausdorff distance from Γ less than c = (2λ2 + 1)ε: in fact break up the interval on which Γ is defined into equal subintervals of length 2λε and take the geodesics between the images under Γ of the endpoints of the subintervals. Since Δ is piecewise geodesic, Δ is Lipschitz continuous with constant λ1, d(Δ(s),Δ(t)) ≤ λ1|st|, where λ1 ≤ λ + ε. The lower bound is automatic at the endpoints of intervals. By construction the other values differ from these by a uniformly bounded depending only on λ and ε; the lower bound inequality holds by increasing ε by adding on twice this uniform bound.
• If γ is a piecewise smooth curve segment lying outside an s-neighbourhood of a geodesic line and P is the orthogonal projection onto the geodesic line then:[24]
${\displaystyle \ell (P\circ \gamma )\leq {\ell (\gamma ) \over \cosh s}.}$
In fact applying an isometry in the upper half plane it may be assumed that the geodesic line is the positive imaginary axis in which case the orthogonal projection onto it is given by P(z) = i|z| and |z| / Im z = cosh d(z,Pz). Hence the hypothesis implies |γ(t)| ≥ cosh(s) Im γ(t), so that
${\displaystyle \ell (P\circ \gamma )=\int _{a}^{b}{|d\gamma | \over |\gamma |}\leq \int _{a}^{b}{|d\gamma | \over \cosh(s)\,\Im \gamma }={\ell (\gamma ) \over \cosh(s)}.}$
• There is a constant h > 0 depending only on λ and ε such that Γ[a,b] lies within an h-neighbourhood of the geodesic segment [Γ(a),Γ(b)].[25]
Let γ(t) be the geodesic line containing the geodesic segment [Γ(a),Γ(b)]. Then there is a constant h > 0 depending only on λ and ε such that h-neighbourhood Γ[a,b] lies within an h-neighbourhood of γ(R). Indeed for any s > 0, the subset of [a,b] for which Γ(t) lies outside the closure of the s-neighbourhood of γ(R) is open, so a countable union of open intervals (c,d). Then
${\displaystyle \ell (\Gamma |_{[c,d]})\leq s_{1}\equiv \lambda ^{2}(2s+\varepsilon )\left(1-{\lambda ^{2} \over \cosh(s)}\right),}$
since the left hand side is less than or equal to λ|cd| and
${\displaystyle {|c-d| \over \lambda }-\varepsilon \leq d(\Gamma (c),\Gamma (d))\leq 2s+d(P\circ \Gamma |_{[c,d]})\leq 2s+{\lambda |c-d| \over \cosh(s)}.}$
Hence every point lies at a distance less than or equal to s + s1 of γ(R). To deduce the assertion, note that the subset of [a,b] for which Γ(t) lies outside the closure of the s-neighbourhood of [Γ(a),Γ(b)] ⊂ γ(R) is open, so a union of intervals (c,d) with Γ(c) and Γ(d) both at a distance s + s1 from either Γ(a) or Γ(b). Then
${\displaystyle \ell (\Gamma |_{[c,d]})\leq s_{2}\equiv \lambda ^{2}(2(s+s_{1})+\varepsilon ),}$
since
${\displaystyle {|c-d| \over \lambda }-\varepsilon \leq d(\Gamma (c),\Gamma (d))\leq 2(s+s_{1}).}$
Hence the assertion follows taking any h greater than s +s1 + s2.
• There is a constant h > 0 depending only on λ and ε such that the geodesic segment [Γ(a),Γ(b)] lies within an h-neighbourhood of Γ[a,b].[26]
In fact, every point of Γ[a,b] lies within a distance h of [Γ(a),Γ(b)]. Thus orthogonal projection P carries each point of Γ[a,b] onto a point in the closed convex set [Γ(a),Γ(b)] at a distance less than h. Since P is continuous and Γ[a,b] connected, the map P must be onto since the image contains the endpoints of [Γ(a),Γ(b)]. But then every point of [Γ(a),Γ(b)] is within a distance h of a point of Γ[a,b].

### Gromov's proof for Poincaré disk

The generalisation of Morse's lemma to CAT(-1) spaces is often referred to as the Morse–Mostow lemma and can be proved by a straightforward generalisation of the classical proof. There is also a generalisation for the more general class of hyperbolic metric spaces due to Gromov. Gromov's proof is given below for the Poincaré unit disk; the properties of hyperbolic metric spaces are developed in the course of the proof, so that it applies mutatis mutandi to CAT(-1) or hyperbolic metric spaces.[14][15]

In fact since this is a large scale phenomenon, it is enough to check that any maps Δ from {0, 1, 2, ..., N} for any N > 0 to the disk satisfying the inequalities is within a Hausdorff distance R1 of the geodesic segment [Δ(0),Δ(N)]. For then translating it may be assumed without loss of generality Γ is defined on [0,r] with r > 1 and then, taking N = [r] (the integer part of r), the result can be applied to Δ defined by Δ(i) = Γ(i). The Hausdorff distance between the images of Γ and Δ is evidently bounded by a constant R2 depending only on λ and ε.
Now the incircle of a geodesic triangle has diameter less than δ where δ = 2 log 3; indeed it is strictly maximised by that of an ideal triangle where it equals 2 log 3. In particular, since the incircle breaks the triangle breaks the triangle into three isosceles triangles with the third side opposite the vertex of the original triangle having length less than δ, it follows that every side of a geodesic triangle is contained in a δ-neighbourhood of the other two sides. A simple induction argument shows that a geodesic polygon with 2k + 2 vertices for k ≥ 0 has each side within a (k + 1)δ neighbourhood of the other sides (such a polygon is made by combining two geodesic polygons with 2k−1 + 1 sides along a common side). Hence if M ≤ 2k + 2, the same estimate holds for a polygon with M sides.
For yi = Δ(i) let f(x) = min d(x,yi), the largest radius for a closed ball centred on x which contains no yi in its interior. This is a continuous function non-zero on [Δ(0),Δ(N)] so attains its maximum h at some point x in this segment. Then [Δ(0),Δ(N)] lies within an h1-neighbourhood of the image of Δ for any h1 > h. It therefore suffices to find an upper bound for h independent of N.
Choose y and z in the segment [Δ(0),Δ(N)] before and after x with d(x,y) = 2h and d(x,z) = 2h (or an endpoint if it within a distance of less than 2h from x). Then there are i, j with d(y,Δ(i)), d(z,Δ(j)) ≤ h. Hence d(Δ(i),Δ(j)) ≤ 6h, so that |ij| ≤ 6λh + λε. By the triangle inequality all points on the segments [y,Δ(i)] and [z,Δ(j)] are at a distance ≥ h from x. Thus there is a finite sequence of points starting at y and ending at z, lying first on the segment [y,Δ(i)], then proceeding through the points Δ(i), Δ(i+1), ..., Δ(j), before taking the segment [Δ(j),z]. The successive points Δ(i), Δ(i+1), ..., Δ(j) are separated by a distance no greater than λ + ε and successive points on the geodesic segments can also be chosen to satisfy this condition. The minimum number K of points in such a sequence satisfies K ≤ |i - j| + 3 + 2(λ + ε)–1h. These points form a geodesic polygon, with [y,z] as one of the sides. Take L = [h/δ], so that the (L − 1)δ-neighbourhood of [y,z] does not contain all the other sides of the polygon. Hence, from the result above, it follows that K > 2L − 1 + 2. Hence
${\displaystyle 3+2(\lambda +\varepsilon )^{-1}h+6\lambda h+\varepsilon >2^{h/\delta }+2.}$
This inequality implies that h is uniformly bounded, independently of N, as claimed.
If all points Δ(i) lie within h1 of the [Δ(0),Δ(N)], the result follows. Otherwise the points which do not fall into maximal subsets S = {r, ..., s} with r < s. Thus points in [Δ(0),Δ(N)] have a point Δ(i) with i in the complement of S within a distance of h1. But the complement of S = S1S2, a disjoint union with S1 = {0, ..., r − 1} and S2 = {s + 1, ..., N}. Connectivity of [Δ(0),Δ(N)] implies there is a point x in the segment which is within a distance h1 of points Δ(i) and Δ(j) with i < r and j > s. But then d(Δ(i),Δ(j)) < 2 h1, so |ij| ≤ 2λh1 + λε. Hence the points Δ(k) for k in S lie within a distance from [Δ(0),Δ(N)] of less than h1 + λ|ij| + ε ≤ h1 + λ (2λh1 + λε) + ε ≡ h2.

### Extension to quasigeodesic rays and lines

Recall that in a Hadamard space if [a1,b1] and [a2,b2] are two geodesic segments and the intermediate points c1(t) and c2(t) divide them in the ratio t:(1 – t), then d(c1(t),c2(t)) is a convex function of t. In particular if Γ1(t) and Γ2(t) are geodesic segments of unit speed defined on [0,R] starting at the same point then

${\displaystyle d(\Gamma _{1}(t),\Gamma _{2}(t))\leq {t \over R}d(\Gamma _{1}(R),\Gamma _{2}(R)).}$

In particular this implies the following:

• In a CAT(–1) space X, there is a constant h > 0 depending only on λ and ε such that any quasi-geodesic ray is within a bounded Hausdorff distance h of a geodesic ray. A similar result holds for quasigeodesic and geodesic lines.
In fact if Γ(t) is a geodesic say with constant λ and ε, let ΓN(t) be the unit speed geodesic for the segment [Γ(0),Γ(N)]. The estimate above shows that for fixed R > 0 and N sufficiently large, (ΓN) is a Cauchy sequence in C([0,R],X) with the uniform metric. Thus ΓN tends to a geodesic ray γ uniformly on compacta the bound on the Hausdorff distances between Γ and the segments ΓN applies also to the limiting geodesic γ. The assertion for quasigeodesic lines follows by taking ΓN corresponding to the geodesic segment [Γ(–N),Γ(N)].

## Efremovich–Tikhomirova theorem

Before discussing CAT(-1) spaces, this section will describe the Efremovich–Tikhomirova theorem for the unit disk D with the Poincaré metric. It asserts that quasi-isometries of D extend to quasi-Möbius homeomorphisms of the unit disk with the Euclidean metric. The theorem forms the prototype for the more general theory of CAT(-1) spaces. Their original theorem was proved in a slightly less general and less precise form in Efremovich & Tikhomirova (1964) and applied to bi-Lipschitz homeomorphisms of the unit disk for the Poincaré metric;[27] earlier, in the posthumous paper Mori (1957), the Japanese mathematician Akira Mori had proved a related result within Teichmüller theory assuring that every quasiconformal homeomorphism of the disk is Hölder continuous and therefore extends continuously to a homeomorphism of the unit circle (it is known that this extension is quasi-Möbius).[28]

### Extension of quasi-isometries to boundary

If X is the Poincaré unit disk, or more generally a CAT(-1) space, the Morse lemma on stability of quasigeodesics implies that every quasi-isometry of X extends uniquely to the boundary. By definition two self-mappings f, g of X are quasi-equivalent if supX d(f(x),g(x)) < ∞, so that corresponding points are at a uniformly bounded distance of each other. A quasi-isometry f1 of X is a self-mapping of X, not necessarily continuous, which has a quasi-inverse f2 such that f1f2 and f2f1 are quasi-equivalent to he identity and such that there are constants λ ≥ 1 and ε > 0 such that for all x, y in X and both mappings

${\displaystyle \lambda ^{-1}d(x,y)-\varepsilon \leq d(f_{k}(x),f_{k}(y))\leq \lambda d(x,y)+\varepsilon .}$

Note that quasi-inverses are unique up to quasi-equivalence; that equivalent definition could be given using possibly different right and left-quasi inverses, but they would necessarily be quasi-equivalent; that quasi-isometries are closed under composition which up to quasi-equivalence depends only the quasi-equivalence classes; and that, modulo quasi-equivalence, the quasi-isometries form a group.[29]

Fixing a point x in X, given a geodesic ray γ starting at x, the image f ∘ γ under a quasi-isometry f is a quasi-geodesic ray. By the Morse-Mostow lemma it is within a bounded distance of a unique geodesic ray δ starting at x. This defines a mapping ∂f on the boundary ∂X of X, independent of the quasi-equivalence class of f, such that ∂(fg) = ∂f ∘ ∂g. Thus there is a homomorphism of the group of quasi-isometries into the group of self-mappings of ∂X.

To check that ∂f is continuous, note that if γ1 and γ2 are geodesic rays that are uniformly close on [0,R], within a distance η, then f ∘ γ1 and f ∘ γ2 lie within a distance λη + ε on [0,R], so that δ1 and δ2 lie within a distance λη + ε + 2h(λ,ε); hence on a smaller interval [0,r], δ1 and δ2 lie within a distance (r/R)⋅[λη + ε + 2h(λ,ε)] by convexity.[30]

On CAT(-1) spaces, a finer version of continuity asserts that ∂f is a quasi-Möbius mapping with respect to a natural class of metric on ∂X, the "visual metrics" generalising the Euclidean metric on the unit circle and its transforms under the Möbius group. These visual metrics can be defined in terms of Busemann functions.[31]

In the case of the unit disk, Teichmüller theory implies that the homomorphism carries quasiconformal homeomorphisms of the disk onto the group of quasi-Möbius homeomorphisms of the circle (using for example the Ahlfors–Beurling or Douady–Earle extension): it follows that the homomorphism from the quasi-isometry group into the quasi-Möbius group is surjective.

In the other direction, it is straightforward to prove that the homomorphism is injective.[32] In fact suppose that f is a quasi-isometry of the unit disk such that ∂f is the identity. The assumption and the Morse lemma implies that if γ(R) is a geodesic line, then f(γ(R)) lies in an h-neighbourhood of γ(R). Now take a second geodesic line δ such that δ and γ intersect orthogonally at a given point in a. Then f(a) lies in the intersection of h-neighbourhoods of δ and γ. Applying a Möbius transformation, it can be assumed that a is at the origin of the unit disk and the geodesics are the real and imaginary axes. By convexity, the h-neighbourhoods of these axes intersect in a 3h-neighbourhood of the origin: in fact if z lies in both neighbourhoods, let x and y be the orthogonal projections of z onto the x- and y-axes; then d(z,x) ≤ h so taking projections onto the y-axis, d(0,y) ≤ h; hence d(z,0) ≤ d(z,y) + d(y,0) ≤ 2h. Hence d(a,f(a)) ≤ 2h, so that f is quasi-equivalent to the identity, as claimed.

### Cross ratio and distance between non-intersecting geodesic lines

Given two distinct points z, w on the unit circle or real axis there is a unique hyperbolic geodesic [z,w] joining them. It is given by the circle (or straight line) which cuts the unit circl unit circle or real axis orthogonally at those two points. Given four distinct points a, b, c, d in the extended complex plane their cross ratio is defined by

${\displaystyle (a,b;c,d)={(a-c)(b-d) \over (a-d)(b-c)}.}$

If g is a complex Möbius transformation then it leaves the cross ratio invariant: (g(a),g(b);g(c),g(d)) = (a,b:c,d). Since the Möbius group acts simply transitively on triples of points, the cross ratio can alternatively be described as the complex number z in C\{0,1} such that g(a) = 0, g(b) = 1, g(c) = λ, g(d) = ∞ for a Möbius transformation g.

Since a, b, c and d all appear in the numerator defining the cross ratio, to understand the behaviour of the cross ratio under permutations of a, b, c and d, it suffices to consider permutations that fix d, so only permute a, b and c. The cross ratio transforms according to the anharmonic group of order 6 generated by the Möbius transformations sending λ to 1 – λ and λ−1. The other three transformations send λ to 1 – λ−1, to λ(λ – 1)−1 and to (1 – λ)−1.[33]

Now let a, b, c, d be points on the unit circle or real axis in that order. Then the geodesics [a,b] and [c,d] do not intersect and the distance between these geodesics is well defined: there is a unique geodesic line cutting these two geodesics orthogonally and the distance is given by the length of the geodesic segment between them. It is evidently invariant under real Möbius transformations. To compare the cross ratio and the distance between geodesics, Möbius invariance allows the calculation to be reduced to a symmetric configuration. In fact for 0 < r < R, take a = –R, b = −r, c = r, d = R. Then λ = (a,b;c,d) = (R + r)2/4rR = (t + 1)2/4t where t = R/r > 1. On the other hand, the geodesics [a,d] and [b,c] are the semicircles in the upper half plane of radius r and R. The geodesic which cuts them orthogonally is the positive imaginary axis, so the distance between them is the hyperbolic distance between ir and iR, d(ir,iR) = log R/r = log t. Let s = log t, then λ = cosh2(s/2), so that there is a constant C > 0 such that, if (a,b;c,d) > 1, then

${\displaystyle d([a,d];[b,c])-C\leq \log(a,b;c,d)\leq d([a,d];[b,c])+C,}$

since log[cosh(x)/expx)] = log (1 + exp(–2x))/2 is bounded above and below in x ≥ 0. Note that a, b,c, d are in order around the unit circle if and only if (a,b;c,d) > 1.

A more general and precise geometric interpretation of the cross ratio can be given using projections of ideal points on to a geodesic line; it does not depend on the order of the points on the circle and therefore whether or not geodesic lines intersect.[34]

• If p and q are the feet of the perpendiculars from c and d to the geodesic line ab, then d(p,q) = |log |(a,b;c,d)||.

Since both sides are invariant under Möbius transformations, it suffices to check this in the case that a = 0, b = ∞, c = x and d = 1. In this case the geodesic line is the positive imaginary axis, right hand side equals |log |x||, p = |x|i and q = i. So the left hand side equals |log|x||. Note that p and q are also the points where the incircles of the ideal triangles abc and abd touch ab.

### Proof of theorem

A homeomorphism F of the circle is quasisymmetric if there are constants a, b > 0 such that

${\displaystyle \displaystyle {{|F(z_{1})-F(z_{2})| \over |F(z_{1})-F(z_{3})|}\leq a{|z_{1}-z_{2}|^{b} \over |z_{1}-z_{3}|^{b}}.}}$

It is quasi-Möbius is there are constants c, d > 0 such that

${\displaystyle \displaystyle {|(F(z_{1}),F(z_{2});F(z_{3}),F(z_{4}))|\leq c|(z_{1},z_{2};z_{3},z_{4})|^{d},}}$

where

${\displaystyle \displaystyle {(z_{1},z_{2};z_{3},z_{4})={(z_{1}-z_{3})(z_{2}-z_{4}) \over (z_{2}-z_{3})(z_{1}-z_{4})}}}$

denotes the cross-ratio.

It is immediate that quasisymmetric and quasi-Möbius homeomorphisms are closed under the operations of inversion and composition.

If F is quasisymmetric then it is also quasi-Möbius, with c = a2 and d = b: this follows by multiplying the first inequality for (z1,z3,z4) and (z2,z4,z3). Conversely any quasi-Möbius homeomorphism F is quasisymmetric. To see this, it can be first be checked that F (and hence F−1) is Hölder continuous. Let S be the set of cube roots of unity, so that if ab in S, then |ab| = 2 sin π/3 = 3. To prove a Hölder estimate, it can be assumed that xy is uniformly small. Then both x and y are greater than a fixed distance away from a, b in S with ab, so the estimate follows by applying the quasi-Möbius inequality to x, a, y, b. To verify that F is quasisymmetric, it suffices to find a uniform upper bound for |F(x) − F(y)| / |F(x) − F(z)| in the case of a triple with |xz| = |xy|, uniformly small. In this case there is a point w at a distance greater than 1 from x, y and z. Applying the quasi-Möbius inequality to x, w, y and z yields the required upper bound. To summarise:

• A homeomorphism of the circle is quasi-Möbius if and only if it is quasisymmetric. In this case it and its inverse are Hölder continuous. The quasi-Möbius homeomorphisms form a group under composition.[35]

To prove the theorem it suffices to prove that if F = ∂f then there are constants A, B > 0 such that for a, b, c, d distinct points on the unit circle[36]

${\displaystyle \displaystyle {|(F(a),F(b);F(c),F(d))|\leq A|(a,b;c,d)|^{B}.}}$

It has already been checked that F (and is inverse) are continuous. Composing f, and hence F, with complex conjugation if necessary, it can further be assumed that F preserves the orientation of the circle. In this case, if a,b, c,d are in order on the circle, so too are there images under F; hence both (a,b;c,d) and (F(a),F(b);F(c),F(d)) are real and greater than one. In this case

${\displaystyle (F(a),F(b);F(c),F(d))\leq A(a,b;c,d)^{B}.}$

To prove this, it suffices to show that log (F(a),F(b);F(c),F(d)) ≤ B log (a,b;c,d) + C. From the previous section it suffices show d([F(a),F(b)],[F(c),F(d)]) ≤ P d([a,b],[c,d]) + Q. This follows from the fact that the images under f of [a,b] and [c,d] lie within h-neighbourhoods of [F(a),F(b)] and [F(c),F(d)]; the minimal distance can be estimated using the quasi-isometry constants for f applied to the points on [a,b] and [c,d] realising d([a,b],[c,d]).

Adjusting A and B if necessary, the inequality above applies also to F−1. Replacing a, b, c and d by their images under F, it follows that

${\displaystyle A^{-1}|(a,b;c,d)|^{-B}\leq |(F(a),F(b);F(c),F(d))|\leq A|(a,b;c,d)|^{B}}$

if a, b, c and d are in order on the unit circle. Hence the same inequalities are valid for the three cyclic of the quadruple a, b, c, d. If a and b are switched then the cross ratios are sent to their inverses, so lie between 0 and 1; similarly if c and d are switched. If both pairs are switched, the cross ratio remains unaltered. Hence the inequalities are also valid in this case. Finally if b and c are interchanged, the croos ratio changes from λ to λ–1 (λ – 1) = 1 – λ–1, which lies between 0 and 1. Hence again the same inequalities are valid. It is easy to check that using these transformations the inequlaties are valid for all possible permutations of a, b, c and d, so that F and its inverse are quasi-Möbius homeomorphisms.

## Busemann functions and visual metrics for CAT(-1) spaces

Busemann functions can be used to determine special visual metrics on the class of CAT(-1) spaces. These are complete geodesic metric spaces in which the distances between points on the boundary of a geodesic triangle are less than or equal to the comparison triangle in the hyperbolic upper half plane or equivalently the unit disk with the Poincaré metric. In the case of the unit disk the chordal metric can be recovered directly using Busemann functions Bγ and the special theory for the disk generalises completely to any proper CAT(-1) space X. The hyperbolic upper half plane is a CAT(0) space, as lengths in a hyperbolic geodesic triangle are less than lengths in the Euclidean comparison triangle: in particular a CAT(-1) space is a CAT(0) space, so the theory of Busemann functions and the Gromov boundary applies. From the theory of the hyperbolic disk, it follows in particular that every geodesic ray in a CAT(-1) space extends to a geodesic line and given two points of the boundary there is a unique geodesic γ such that has these points as the limits γ(±∞). The theory applies equally well to any CAT(−κ) space with κ > 0 since these arise by scaling the metric on a CAT(-1) space by κ−1/2. On the hyperbolic unit disk D quasi-isometries of D induce quasi-Möbius homeomorphisms of the boundary in a functorial way. There is a more general theory of Gromov hyperbolic spaces, a similar statement holds, but with less precise control on the homeomorphisms of the boundary.[14][15]

## Applications in percolation theory

More recently Busemann functions have been used by probabilists to study asymptotic properties in models of first-passage percolation[37][38] and directed last-passage percolation.[39]

## Notes

1. ^ Busemann 1955, p. 131
2. ^ Bridson & Haefliger 1999, p. 273
3. ^ a b c d Ballmann, Gromov & Schroeder 1985
4. ^ Bridson & Haefliger 1999, pp. 268–269
5. ^ Lurie 2010, p. 13
6. ^ Bridson & Haefliger 1999, pp. 271–272
7. ^ a b c Bridson & Haefliger 1999, pp. 271–272
8. ^ Dal'bo, Peigné & Sambusetti 2012, pp. 94–96
9. ^ Bridson & Haefliger 1999, pp. 260–276
10. ^ Ballmann 1995, pp. 27–30
11. ^ Bridson & haefliger 1999, pp. 271–272
12. ^ In geodesic normal coordinates, the metric g(x) = I + ε ||x||. By geodesic convexity, a geodesic from p to q lies in the ball of radius r = max ||p||, ||q||. The straight line segment gives an upper estimate for d(p,q) of the stated form. To obtain a similar lower estimate, observe that if c(t) is a smooth path from p to q, then L(c) ≥ (1 − ε r) ⋅ ∫ || c' || dt ≥ (1 − ε r) ⋅ ||pq||. (Note that these inequalities can be improved using the sharper estimate g(x) = I + ε ||x||2.)
13. ^ Note that a metric space X which is complete and locally compact need not be proper, for example R with the metric d(x,y) = |xy|/(1 + |xy|). On the other hand, by the Hopf-Rinow theorem for metric spaces, if X is complete, locally compact and geodesic—every two points x and y are joined by a geodesic parametrised by arclength—then X is proper (see Bridson & Haefliger 1999, pp. 35–36). Indeed if not, there is a point x in X and a closed ball K = B(x,r) maximal subject to being compact; then, since by hypothesis B(x,R) is non-compact for each R > r, a diagonal argument shows that there is a sequence (xn) with d(x,xn) decreasing to r but with no convergent subsequence; on the other hand taking yn on a geodesic joining x and xn, with d(x,yn) = r, compactness of K implies (yn), and hence (xn), has a convergent subsequence, a contradiction.
14. ^ a b c Bourdon 1995
15. ^ a b c Buyalo & Schroeder 2007
16. ^ Mostow 1073
17. ^ Roe 2003
18. ^ Buyalo & Schroeder 2007, pp. 1–6
19. ^ Bridson & Haefliger 1999, pp. 399–405
20. ^ Kapovich 2001, pp. 51–52
21. ^ Morse 1924
22. ^ Ratcliffe 2006, pp. 580–599
23. ^ Kapovich 2001, p. 51
24. ^ Ratcliffe 2006, p. 583, Lemma 4
25. ^ Ratcliffe 2006, pp. 584–586, Lemmas 5–6
26. ^ Kapovich 2001, p. 52
27. ^ Bi-Lipschitz homeomorphisms are those for which they and their inverses are Lipschitz continuous
28. ^ See:
29. ^ See:
30. ^ Bridson & Haefliger 1999, pp. 430–431
31. ^ See:
32. ^ Roe 2003, p. 113
33. ^ Beardon 1983, pp. 75–78 Note that there is a natural homomorphism of S4 onto S3, acting by conjugation on (a,b)(c,d), (a,c)(b,d) and (a,d)(b,c). Indeed these permutations together with the identty form a normal Abelian subgroup equal to its own centraliser: the action of S4 by conjugation on the non-trivial elements yields the homomorphism onto S3.
34. ^ See:
35. ^ Väisälä 1984
36. ^ Bourdon 2009
37. ^ Hoffman 2005
38. ^ Damron & Hanson 2014
39. ^ Georgiou, Rassoul-Agha & Seppäläinen 2016