Bessel's inequality showed that the sum of squared coordinates of a vector against an orthonormal family never exceeds its squared length. This post asks when they spend all of it. An orthonormal family large enough that no length is wasted is a basis, and against it every vector becomes the sum of its coordinates, a single structure that contains Fourier series, eigenfunction expansions, and the Karhunen-Loeve expansion as special cases. The setting is a Hilbert space , where completeness is what lets an infinite orthogonal series converge to an actual vector [1], [2]. We assume is separable, so that a countable orthonormal family suffices, and we take the scalars to be real, so that every coordinate appears squared rather than modulus-squared; over each becomes and the proofs go through verbatim.
#Convergence of orthogonal series
The first question is which infinite combinations of an orthonormal family make sense. Pythagoras answers it exactly.
Let be orthonormal in and real scalars. The series converges in if and only if , and then .
Let . For the increment has, by Pythagoras, squared norm . Hence is Cauchy in if and only if the partial sums of are Cauchy in , that is if and only if . Completeness of turns the Cauchy condition into convergence, and the norm identity follows from by letting , the norm being continuous by Cauchy-Schwarz.
For any , Bessel's inequality gives , so by Proposition 1 the series always converges, whether or not is a basis. Write and for its limit. The residual is orthogonal to every .
For every , the residual satisfies for all .
The map is continuous by Cauchy-Schwarz (the inner-product-spaces post), so it passes through the series, giving by orthonormality. Then .
#The basis conditions
An orthonormal family is a basis when it leaves no length unaccounted for. Several precise statements of that idea coincide.
For a countable orthonormal family in a Hilbert space , the following are equivalent. First, the closed linear span of is all of . Second, for every . Third, the Parseval identity holds for every . Fourth, the only vector orthogonal to every is . A family with these properties is an orthonormal basis.
For a finite family this is the elementary finite-dimensional statement, with the orthogonal projection onto the closed subspace and Parseval the finite Pythagoras, so the same cycle holds with finite sums; assume henceforth the family is infinite. Write for the closed span and recall , the limit of partial sums lying in .
First implies second. If , then . By Lemma 2 the residual is orthogonal to every , hence to their span, hence to its closure by continuity. A vector of orthogonal to is orthogonal to itself, so and .
Second implies third. If , then Proposition 1 gives .
Third implies fourth. If for all , the Parseval identity gives , so .
Fourth implies first. For any the residual is orthogonal to every by Lemma 2, so by the fourth condition , putting . Thus .
Since is orthogonal to every , it is orthogonal to their span and, by continuity of the inner product, to its closure ; as , the vector is the orthogonal projection of onto .
The fourth condition is maximality. An orthonormal family is a basis exactly when it cannot be extended, since any unit vector orthogonal to all of it could be appended. The Parseval identity is the infinite-dimensional Pythagoras, splitting a vector's squared length into the squares of its coordinates with nothing left over, and the second condition is the expansion that makes the coordinates a faithful description of the vector.
#Existence by Gram-Schmidt
A basis is only useful if one exists. In a separable space it always does, manufactured from a dense sequence by orthonormalising it.
Every separable Hilbert space has an orthonormal basis, finite or countably infinite.
Separability gives a countable dense sequence . Discard each that lies in the span of its predecessors, leaving a subsequence that is linearly independent with the same span. Apply the Gram-Schmidt process, setting and inductively
The vector is nonzero because is independent of , whose span equals that of , and is orthogonal to each with by construction, so is orthonormal. At every stage the span of equals that of , so the closed span of contains the closure of the span of , which is by density. By Theorem 3 the family is a basis.
#The isometry with l-squared
A basis identifies the abstract space with the concrete sequence space. Sending a vector to its coordinates is an isometry, and completeness makes it onto.
Let be an orthonormal basis of an infinite-dimensional separable Hilbert space . The coordinate map is an isometric isomorphism of onto .
Linearity of is linearity of the inner product in . The Parseval identity of Theorem 3 reads , so preserves the norm, which makes it injective. For surjectivity take any . Since , Proposition 1 makes converge in , and by the computation in Lemma 2, so . Thus is a norm-preserving linear bijection, an isometric isomorphism.
Every separable infinite-dimensional Hilbert space is therefore a copy of . A finite-dimensional space is the elementary case, a copy of Euclidean under the coordinate map on a finite basis, where surjectivity is immediate and no convergence argument is needed. The choice of basis is the only freedom, and the right basis makes a problem transparent. The Fourier basis diagonalises translation and differentiation, and the eigenvectors of a compact self-adjoint operator, constructed in the spectral theorem to come, form the basis in which that operator becomes a diagonal multiplication. The Karhunen-Loeve expansion is exactly this last construction applied to the covariance operator of a process, producing the orthonormal basis in which the process has uncorrelated coordinates. A basis is the coordinate system in which the structure of the problem is laid bare.