The Gaussian distribution is the fixed point of linear operations. A linear image of a Gaussian is Gaussian, a sum of independent Gaussians is Gaussian, and the limit of normalised sums is Gaussian, which is why it is the universal law of aggregated randomness. Measure-theoretically a Gaussian is determined by two numbers, its mean and its variance, and a Gaussian vector by a mean vector and a covariance matrix, the minimal second-order data. This post builds the Gaussian vector and the Gaussian process from the characteristic function of the previous post [1], [2].
#The Gaussian characteristic function
A random variable is standard normal, written , when it has density . A variable is Gaussian , including the degenerate constant when .
The standard normal has characteristic function , and has .
Differentiating under the integral, licensed because , gives . Integrating by parts with moves the onto the exponential. The boundary term vanishes because and at , leaving . With this linear equation has the unique solution . For , .
The quadratic exponent is the signature of the Gaussian. The transform of is an exponential of a quadratic in , and reading off the coefficients recovers the mean and variance.
#Gaussian vectors
The right definition of a Gaussian vector asks that the distribution survive every projection to a line.
A random vector is Gaussian when every linear combination is a univariate Gaussian. Its mean is and its covariance is the matrix .
The covariance is symmetric and positive semidefinite, since . Mean and covariance are all the data there is.
A Gaussian vector has characteristic function , so its law is determined by and alone.
For fixed the scalar is Gaussian by definition, with mean and variance . Evaluating its scalar characteristic function at argument , by Proposition 2. The characteristic function depends only on and . If two laws on share a characteristic function, then for every the projections have equal scalar characteristic functions along the ray, so by the uniqueness theorem all one-dimensional projections coincide, and by the Cramer-Wold device the projections determine the joint law. Hence the law depends only on and .
In the Gaussian world the second moment is the whole story, and that collapses the distinction between independence and zero correlation.
Independence always implies zero covariance. Conversely, if the cross-covariance block is zero, then is block diagonal, so the quadratic form splits, , and the characteristic function factors, . Each factor is a genuine marginal characteristic function, because a subvector is itself Gaussian (any is a linear combination of all coordinates of , hence univariate Gaussian by Definition 3) with covariance the leading block , so by Theorem 4 its characteristic function is , the first factor, and likewise for . A joint characteristic function that factors into the marginals is the product law, so the subvectors are independent.
In general zero correlation is weaker than independence; the Gaussian collapses the two, which is what makes it the tractable model of dependence.
Conversely, any mean and any positive semidefinite covariance are realised by some Gaussian vector.
For every and symmetric positive semidefinite , there is a Gaussian vector with mean and covariance .
Being symmetric positive semidefinite, factors as , for instance through its spectral decomposition with . Let have independent standard normal coordinates and set . Then and , and every is a linear combination of independent Gaussians, hence Gaussian, so is a Gaussian vector with the required mean and covariance.
#Gaussian processes
A process is a family of random variables indexed by a parameter, usually time. The Gaussian property is imposed on every finite subfamily at once.
A Gaussian process is a family of random variables for which every finite vector is a Gaussian vector. It is described by its mean function and its covariance function .
The covariance function is symmetric and positive semidefinite, meaning every matrix is positive semidefinite, being the covariance of . These two functions determine the process, and conversely any such pair is realised.
For every function on and every symmetric positive semidefinite kernel on , there is a Gaussian process with mean function and covariance function , unique in law.
For each finite set define and , which is symmetric positive semidefinite by hypothesis, and let be the Gaussian law of mean and covariance from Proposition 6. This family is consistent, since a marginal of a Gaussian over a subset of coordinates is the Gaussian with the corresponding subvector mean and submatrix covariance, which is exactly on the smaller index set, and permuting indices permutes the law correspondingly. The Kolmogorov extension theorem, which builds a measure on the product space from a consistent family of finite-dimensional laws by Caratheodory extension from the algebra of cylinder sets, then produces a process with these finite-dimensional distributions, Gaussian by construction. Uniqueness in law holds because the finite-dimensional distributions determine the law on the cylinder sigma-algebra, again an intersection-closed generating system.
The covariance function carries the entire second-order structure, and when the index set is an interval and the kernel is continuous, it is exactly the Mercer kernel of the previous track. The Karhunen-Loeve expansion applies the spectral decomposition of that kernel to the process itself, writing with independent standard Gaussian coefficients , where the eigenvalues are the variances of the modes and the eigenfunctions their shapes. Brownian motion is the Gaussian process with and , and its construction, the foundation of stochastic calculus, is the next step.