The characteristic function is the Fourier transform of a probability law, and it converts the two hardest operations on random variables into easy ones. Adding independent variables becomes multiplying their transforms, and convergence in distribution becomes pointwise convergence of the transforms, so the central limit theorem reduces to a Taylor expansion. This post develops the characteristic function from its definition through the inversion formula and the Levy continuity theorem, on the probability space and using the independence of the previous posts [1], [2].
#Definition and elementary properties
The characteristic function of a random variable is , defined for every real .
The expectation exists because is bounded, , so the characteristic function is defined for every law without integrability assumptions. This is its structural advantage over the moment generating function, which requires to be finite near .
The characteristic function satisfies and , is uniformly continuous on , and for independent and obeys .
At , , and . For uniform continuity, , a bound independent of , and with pointwise as , so the dominated convergence theorem sends , which is uniform continuity. For the product rule, independence makes and independent, so the factorisation of expectations applied to real and imaginary parts gives .
The product rule is the reason characteristic functions suit sums. The distribution of a sum of independent variables, a convolution that is awkward to compute directly, becomes a pointwise product of transforms.
#Moments
Differentiating under the expectation reads the moments of off the derivatives of at the origin.
If , then is times continuously differentiable with , so , and
The difference quotient converges to as and is bounded in modulus by , since . When the dominated convergence theorem lets the derivative pass inside, . Iterating times, with dominator integrable by hypothesis, gives , continuous in by dominated convergence, and at equal to . The expansion Equation (1) is Taylor's theorem applied to the times differentiable at the origin with these derivatives.
#The inversion formula and uniqueness
The characteristic function determines the law. The proof is an explicit formula recovering the probability of an interval from the transform.
For with ,
Write and insert it into the integral. The integrand is bounded by on , since , so the finite-measure Fubini theorem permits exchanging the integrals,
The inner expression collapses to a sine integral because, after division by , the cosine parts of carry a factor that is odd in and integrate to zero over the symmetric range, while the sine parts are even and survive. The Dirichlet integral as , with the partial integrals bounded uniformly in and . So the inner expression is bounded and converges to , which equals on , at , and outside . The bounded convergence theorem sends the right side of Equation (3) to , which under the continuity hypothesis is .
Two random variables with the same characteristic function have the same law.
The formula Equation (2) determines from for every that are not atoms. The non-atoms are all but countably many points, hence dense, so the distribution function is determined at a dense set of points and, being right-continuous, everywhere. The law is determined by its distribution function.
#Convergence in distribution and tightness
A sequence of laws converges in distribution, written , when at every continuity point of . The characteristic functions control this convergence, but only once tightness prevents mass from escaping to infinity, which the next lemma controls by bounding the tail mass with the characteristic function near the origin.
For every , .
By change of variables and Fubini,
the inner integral being . The integrand is nonnegative, and where it satisfies . Restricting the integral to and using this bound gives the right side at least .
The estimate says a characteristic function close to near the origin forces the law to concentrate, because small on bounds the tail mass beyond .
#The Levy continuity theorem
Let have characteristic functions . If for every and is continuous at , then is the characteristic function of a law and .
First, tightness. Fix . Since and is continuous at , choose with . The integrands are bounded by and converge pointwise to , so by bounded convergence the integrals converge and for all large . By Lemma 6, for those , and enlarging to cover the finitely many remaining makes the family tight.
Now take any subsequence. By Helly's selection theorem, every sequence of distribution functions has a further subsequence converging at continuity points to a nondecreasing right-continuous limit , obtained by a diagonal extraction of convergent values on the rationals followed by right-continuous interpolation. Tightness forces to be a genuine distribution function, with no mass lost to . Along that subsequence for the law of , and convergence in distribution with the uniformly bounded continuous integrands gives . But by hypothesis, so , and by Corollary 5 every subsequential limit is the same law with characteristic function . A sequence all of whose subsequences have a further subsequence converging to the same limit converges to that limit, so .
The Levy continuity theorem is the analytic engine of the central limit theorem. To prove a sum of independent variables converges in distribution to a Gaussian, one shows its characteristic function, a product of the individual transforms by Proposition 2, converges pointwise to , the transform of the standard normal, and the continuity theorem converts that pointwise convergence into the distributional limit. The expansion Equation (1) supplies the pointwise limit through a second-order Taylor approximation of each factor; that computation is carried out in the central limit theorem post. The characteristic function thereby reduces a statement about the shape of a distribution to a calculation with an ordinary function of a real variable.