A second-order process is a curve in the Hilbert space of square-integrable
random variables, and its calculus is the geometry of that space. We treat the
process through the L-squared inner product, prove mean-square continuity is
equivalent to continuity of the covariance function, construct the mean-square
integral and compute its mean and covariance, and identify the covariance
operator of a process on a compact interval as the Mercer operator of its
covariance kernel. The uncorrelatedness of the Karhunen-Loeve coefficients then
falls out of the eigenstructure of that operator.
A process whose values are square-integrable is a curve in the
Hilbert space L2 of random variables, and the analytic operations on it
(continuity, integration, expansion) are the geometric operations of that space read along the curve. This
mean-square calculus is what makes the covariance function an
operator and the Karhunen-Loeve expansion a spectral decomposition.
This post develops it on the probability space of the track, treating the
inner product ⟨X,Y⟩=E[XY] as the geometry [1]. Write L2=L2(Ω,F,P).
A second-order process(Xt)t∈T is a family of random variables with E[Xt2]<∞ for
every t, so each Xt∈L2. Its mean function is m(t)=E[Xt] and its covariance function is
K(s,t)=Cov(Xs,Xt)=⟨Xs−m(s),Xt−m(t)⟩.
The Cauchy-Schwarz inequality bounds ∣K(s,t)∣≤K(s,s)K(t,t), and the
length of an increment is
read off by expanding the square. Continuity, integrability, and differentiability of the process are all
statements about this increment, hence about the covariance.
A process converges in mean square to X, written Xt→X, when E[(Xt−X)2]→0, that is when
Xt→X in the norm of L2. It is mean-square continuous at t0 when Xt→Xt0 as t→t0.
Theorem2
A second-order process with continuous mean function is mean-square continuous at every point if and only
if its covariance function K is continuous on T×T.
Proof
Suppose K is continuous and m is continuous. By Equation (1), as t→t0 the right side
tends to K(t0,t0)+K(t0,t0)−2K(t0,t0)+0=0, so E[(Xt−Xt0)2]→0 and the process is
mean-square continuous. Conversely, suppose the process is mean-square continuous everywhere. Then the
centred variables Yt=Xt−m(t) form a mean-square continuous curve, since m is continuous, and the
inner product is continuous on L2, so (s,t)↦⟨Ys,Yt⟩=K(s,t) is continuous as a composition of the continuous map (s,t)↦(Ys,Yt) with the continuous
inner product. Thus mean-square continuity forces K continuous.
So a process on an interval has a continuous covariance exactly when it is mean-square continuous, the
hypothesis under which the covariance becomes a Mercer kernel.
Integrating a process over time is taking the limit of Riemann sums in L2. Completeness of L2 makes
the limit exist whenever the covariance is integrable.
Theorem3
Let (Xt)t∈[a,b] be mean-square continuous. The Riemann sums ∑iXti(ti+1−ti) converge
in mean square as the mesh tends to 0 to a limit ∫abXtdt∈L2, with
Mean-square continuity makes m continuous, since ∣m(t)−m(t0)∣=∣E[Xt−Xt0]∣≤∥Xt−Xt0∥L2→0 as the expectation is a bounded linear functional of norm one, and then K is continuous by the converse half of Theorem 2, so g(s,t)=⟨Xs,Xt⟩=K(s,t)+m(s)m(t) is continuous. For two partitions, the inner product of their Riemann sums is the double sum
∑i,j⟨Xsi,Xtj⟩ΔsiΔtj, a Riemann sum for ∫∫g(s,t)dsdt=∫∫(K(s,t)+m(s)m(t))dsdt=:I. Since g is continuous on the compact square [a,b]2 it is uniformly continuous there, so any tagged Riemann sums of g with mesh tending to 0, including those with the mixed tags (si,tj), converge to I. The polarisation
∥Sπ−Sπ′∥2=⟨Sπ,Sπ⟩−2⟨Sπ,Sπ′⟩+⟨Sπ′,Sπ′⟩→I−2I+I=0 then shows every two
sequences of Riemann sums with mesh tending to 0 are Cauchy and share a limit, by
completeness of L2. The mean identity is continuity of the
expectation, a bounded linear functional on L2. The inner product of the limit with itself is the second moment ⟨I,I⟩=∫∫(K+mm); subtracting the squared mean (∫m)2=∫∫m(s)m(t)dsdt leaves the variance identity Var(∫X)=∫∫K(s,t)dsdt.
On a compact interval the covariance function acts as an integral operator, and that operator is exactly
the Mercer operator of the previous track.
Theorem4
Let (Xt)t∈[a,b] be mean-square continuous with continuous covariance K. The covariance operator
(TKf)(s)=∫abK(s,t)f(t)dt,f∈L2([a,b]),(3)
is compact, self-adjoint, and positive, with ⟨TKf,f⟩=Var(∫abf(t)Ytdt)≥0 where
Yt=Xt−m(t).
Proof
The kernel K is continuous and symmetric on the compact square, so by the
Mercer setupTK is compact and self-adjoint. For positivity, take
first a continuous f. Then t↦f(t)Yt is mean-square continuous, so the integral Zf=∫abf(t)Ytdt exists by Theorem 3, and its variance is, by Equation (2) applied
to that process with covariance f(s)K(s,t)f(t),
Var(Zf)=∫ab∫abf(s)K(s,t)f(t)dsdt=⟨TKf,f⟩.(4)
A variance is nonnegative, so ⟨TKf,f⟩≥0 for every continuous f. The quadratic form f↦⟨TKf,f⟩ is continuous on L2, since ∣⟨TKf,f⟩−⟨TKg,g⟩∣≤∥TK∥(∥f∥+∥g∥)∥f−g∥ by boundedness of TK, and the continuous functions are
dense in L2([a,b]), so nonnegativity passes to the closure and
⟨TKf,f⟩≥0 for all f∈L2, making TK positive.
The positivity that was an abstract hypothesis in Mercer's theorem is here a tautology, since the
quadratic form of the covariance operator is the variance of a linear functional of the process. The
spectral theorem therefore gives eigenvalues λn≥0
tending to 0 and continuous orthonormal eigenfunctions φn with TKφn=λnφn.
The eigenfunctions of the covariance operator project the process onto uncorrelated coordinates, a one-line
consequence of the eigenrelation and the content of the Karhunen-Loeve expansion.
Proposition5
For λn>0 define the coefficient ξn=λn1∫abYtφn(t)dt. Then E[ξn]=0 and E[ξnξm]=δnm.
Proof
Each ξn is a mean-square integral of the centred process, so E[ξn]=λn1∫E[Yt]φn(t)dt=0. For the cross moment, write Zn=∫abYsφn(s)ds as the L2 limit of Riemann sums; joint continuity of the inner product gives the polarisation of Equation (2),
E[ZnZm]=⟨Zn,Zm⟩=lim∑i,jφn(si)φm(tj)⟨Ysi,Ytj⟩ΔsiΔtj=∫∫φn(s)K(s,t)φm(t)dsdt, the integrand being continuous. Hence
Since TKφm=λmφm and the eigenfunctions are orthonormal, the right side is
λnλmλm∫φnφm=λnλmλmδnm=δnm.
The expansion Xt=m(t)+∑nλnξnφn(t) now assembles from these pieces. By the uncorrelatedness of Proposition 5 a partial-sum increment ∑N<k≤Mλkξkφk(t) has variance ∑N<k≤Mλkφk(t)2, so its
partial sums are L2-Cauchy and converge in mean square uniformly in t, the truncation error having variance
∑k>Nλkφk(t)2, which is the Mercer tail of K(t,t) and vanishes uniformly by
Mercer's theorem. The result is the
Karhunen-Loeve expansion, the representation of a second-order
process as a series of deterministic modes φn with uncorrelated random amplitudes, the eigenvalues
their variances. When the process is Gaussian the uncorrelated coefficients are independent by the
Gaussian equivalence, so the expansion resolves a
Gaussian process into independent one-dimensional Gaussians, the form in which
Brownian motion is constructed in the next post.
[1]
O. Kallenberg, Foundations of Modern Probability, 3rd ed. Springer, 2021.