Skip to content
homeaboutworkprojectsthesiswritingresume
Loading
~/blog/projection-and-riesz0%dark
  1. home/
  2. writing/
  3. Projection and Riesz Representation

28 May 2026 · 5 min read · updated 09 June 2026

Projection and Riesz Representation

A Hilbert space earns its power from two theorems. The projection theorem says a nonempty closed convex set contains a unique point nearest any given point, and for a closed subspace the nearest point is characterised by an orthogonal residual, giving the decomposition of the space into a subspace and its orthogonal complement. The Riesz representation theorem then identifies every bounded linear functional with an inner product against a fixed vector. Both rest on completeness and the parallelogram law, and together they underlie conditional expectation, the Radon-Nikodym theorem, and the spectral theory of operators.

  • 1 equation
  • 8 results
  • 8 connections
  • functional-analysis
  • hilbert-space
  • projection
On this page▾
  • The projection theorem
  • Orthogonal complement and decomposition
  • Riesz representation

5 min left

  • The projection theorem2m
  • Orthogonal complement and decomposition1m
  • Riesz representation2m

The geometry of an inner product and the completeness of L-squared combine into two theorems that nothing in Hilbert space theory does without. The first says one can always project onto a closed convex set, landing on a unique nearest point; for a subspace this is the orthogonal projection. The second says every continuous linear functional on the space is an inner product against a fixed vector, which is the engine behind the Radon-Nikodym theorem and the existence of adjoints. Both turn on completeness [1]. Throughout, HHH is a real Hilbert space.

#The projection theorem

A set CCC is convex when it contains the segment between any two of its points, that is tx+(1−t)y∈Ctx+(1-t)y\in Ctx+(1−t)y∈C for all t∈[0,1]t\in[0,1]t∈[0,1] whenever x,y∈Cx,y\in Cx,y∈C. The proof below uses only the midpoint 12(x+y)∈C\tfrac12(x+y)\in C21​(x+y)∈C. Closedness and convexity together force a unique nearest point.

Theorem1

Let C⊆HC\subseteq HC⊆H be nonempty, closed, and convex. For every x∈Hx\in Hx∈H there is a unique p∈Cp\in Cp∈C minimising ∥x−c∥\norm{x-c}∥x−c∥ over c∈Cc\in Cc∈C.

Proof

Let d=inf⁡c∈C∥x−c∥d=\inf_{c\in C}\norm{x-c}d=infc∈C​∥x−c∥ and take a sequence cn∈Cc_n\in Ccn​∈C with ∥x−cn∥→d\norm{x-c_n}\to d∥x−cn​∥→d. Apply the parallelogram law to x−cnx-c_nx−cn​ and x−cmx-c_mx−cm​,

∥cn−cm∥2=2∥x−cn∥2+2∥x−cm∥2−4∥x−cn+cm2∥2.(1)\norm{c_n-c_m}^2=2\norm{x-c_n}^2+2\norm{x-c_m}^2-4\Big\|x-\tfrac{c_n+c_m}{2}\Big\|^2. \tag{1}∥cn​−cm​∥2=2∥x−cn​∥2+2∥x−cm​∥2−4​x−2cn​+cm​​​2.(1)

The midpoint 12(cn+cm)\tfrac12(c_n+c_m)21​(cn​+cm​) lies in CCC by convexity, so its distance from xxx is at least ddd, and the last term is at most −4d2-4d^2−4d2, giving 0≤∥cn−cm∥2≤2∥x−cn∥2+2∥x−cm∥2−4d20\le\norm{c_n-c_m}^2\le 2\norm{x-c_n}^2+2\norm{x-c_m}^2-4d^20≤∥cn​−cm​∥2≤2∥x−cn​∥2+2∥x−cm​∥2−4d2. As n,m→∞n,m\to\inftyn,m→∞ the upper bound tends to 2d2+2d2−4d2=02d^2+2d^2-4d^2=02d2+2d2−4d2=0, so lim sup⁡n,m∥cn−cm∥2≤0\limsup_{n,m}\norm{c_n-c_m}^2\le 0limsupn,m​∥cn​−cm​∥2≤0, which with ∥cn−cm∥2≥0\norm{c_n-c_m}^2\ge 0∥cn​−cm​∥2≥0 forces ∥cn−cm∥2→0\norm{c_n-c_m}^2\to 0∥cn​−cm​∥2→0. Hence (cn)(c_n)(cn​) is Cauchy and, by completeness, converges to some ppp, which lies in CCC because CCC is closed, with ∥x−p∥=lim⁡∥x−cn∥=d\norm{x-p}=\lim\norm{x-c_n}=d∥x−p∥=lim∥x−cn​∥=d. If p′p'p′ is another minimiser, the same identity applied to x−px-px−p and x−p′x-p'x−p′ gives ∥p−p′∥2≤2d2+2d2−4d2=0\norm{p-p'}^2\le 2d^2+2d^2-4d^2=0∥p−p′∥2≤2d2+2d2−4d2=0, so p′=pp'=pp′=p.

For a subspace the nearest point has a clean characterisation. The residual x−px-px−p is orthogonal to the subspace.

Theorem2

Let M⊆HM\subseteq HM⊆H be a closed subspace. The nearest point p=PMxp=P_M xp=PM​x is the unique element of MMM with x−p⊥Mx-p\perp Mx−p⊥M, and the map PMP_MPM​ is linear, idempotent, and satisfies ∥PMx∥≤∥x∥\norm{P_M x}\le\norm{x}∥PM​x∥≤∥x∥.

Proof

A subspace is convex, so Theorem 1 gives a unique nearest ppp. For any m∈Mm\in Mm∈M and t∈Rt\in\Rt∈R the point p+tm∈Mp+tm\in Mp+tm∈M, so ∥x−p−tm∥2≥∥x−p∥2\norm{x-p-tm}^2\ge\norm{x-p}^2∥x−p−tm∥2≥∥x−p∥2, which expands to −2t⟨x−p,m⟩+t2∥m∥2≥0-2t\ip{x-p}{m}+t^2\norm{m}^2\ge 0−2t⟨x−p,m⟩+t2∥m∥2≥0 for all ttt, forcing ⟨x−p,m⟩=0\ip{x-p}{m}=0⟨x−p,m⟩=0. Thus x−p⊥Mx-p\perp Mx−p⊥M. Conversely if q∈Mq\in Mq∈M with x−q⊥Mx-q\perp Mx−q⊥M, then for any m∈Mm\in Mm∈M, Pythagoras gives ∥x−m∥2=∥x−q∥2+∥q−m∥2≥∥x−q∥2\norm{x-m}^2=\norm{x-q}^2+\norm{q-m}^2\ge\norm{x-q}^2∥x−m∥2=∥x−q∥2+∥q−m∥2≥∥x−q∥2, so qqq is the nearest point and equals ppp. Linearity follows because x−PMx⊥Mx-P_M x\perp Mx−PM​x⊥M and y−PMy⊥My-P_M y\perp My−PM​y⊥M give (αx+βy)−(αPMx+βPMy)⊥M(\alpha x+\beta y)-(\alpha P_M x +\beta P_M y)\perp M(αx+βy)−(αPM​x+βPM​y)⊥M with αPMx+βPMy∈M\alpha P_M x+\beta P_M y\in MαPM​x+βPM​y∈M, so by uniqueness it is PM(αx+βy)P_M(\alpha x+\beta y)PM​(αx+βy). Idempotence is PMm=mP_M m=mPM​m=m for m∈Mm\in Mm∈M, and Pythagoras on x=PMx+(x−PMx)x=P_M x+(x-P_M x)x=PM​x+(x−PM​x) gives ∥x∥2=∥PMx∥2+∥x−PMx∥2≥∥PMx∥2\norm{x}^2=\norm{P_M x}^2+\norm{x-P_M x}^2\ge\norm{P_M x}^2∥x∥2=∥PM​x∥2+∥x−PM​x∥2≥∥PM​x∥2.

#Orthogonal complement and decomposition

The orthogonal complement of a set SSS is S⊥={y∈H:⟨y,s⟩=0 for all s∈S}S^\perp=\{y\in H:\ip{y}{s}=0\text{ for all }s\in S\}S⊥={y∈H:⟨y,s⟩=0 for all s∈S}, always a closed subspace because it is an intersection of kernels of the continuous functionals ⟨⋅,s⟩\ip{\cdot}{s}⟨⋅,s⟩. Projection splits the space along it.

Corollary3

For a closed subspace MMM, every x∈Hx\in Hx∈H has a unique decomposition x=u+vx=u+vx=u+v with u∈Mu\in Mu∈M and v∈M⊥v\in M^\perpv∈M⊥, namely u=PMxu=P_M xu=PM​x and v=x−PMxv=x-P_M xv=x−PM​x. Thus H=M⊕M⊥H=M\oplus M^\perpH=M⊕M⊥, and (M⊥)⊥=M(M^\perp)^\perp=M(M⊥)⊥=M.

Proof

The decomposition x=PMx+(x−PMx)x=P_M x+(x-P_M x)x=PM​x+(x−PM​x) has PMx∈MP_M x\in MPM​x∈M and x−PMx∈M⊥x-P_M x\in M^\perpx−PM​x∈M⊥ by Theorem 2. If x=u+v=u′+v′x=u+v=u'+v'x=u+v=u′+v′ are two such, then u−u′=v′−vu-u'=v'-vu−u′=v′−v lies in M∩M⊥M\cap M^\perpM∩M⊥, where a vector is orthogonal to itself and hence zero, so the decomposition is unique. The inclusion M⊆(M⊥)⊥M\subseteq(M^\perp)^\perpM⊆(M⊥)⊥ is immediate, and if x∈(M⊥)⊥x\in(M^\perp)^\perpx∈(M⊥)⊥ then x−PMx∈M⊥x-P_M x\in M^\perpx−PM​x∈M⊥ by Theorem 2, while x−PMxx-P_M xx−PM​x also lies in (M⊥)⊥(M^\perp)^\perp(M⊥)⊥ because both xxx and PMx∈M⊆(M⊥)⊥P_M x\in M\subseteq(M^\perp)^\perpPM​x∈M⊆(M⊥)⊥ do, so it is orthogonal to itself and vanishes. Hence x=PMx∈Mx=P_M x\in Mx=PM​x∈M, giving the reverse inclusion.

#Riesz representation

A linear functional φ:H→R\varphi:H\to\Rφ:H→R is bounded when ∣φ(x)∣≤∥φ∥ ∥x∥\abs{\varphi(x)}\le\norm{\varphi}\,\norm{x}∣φ(x)∣≤∥φ∥∥x∥ for a finite constant ∥φ∥\norm{\varphi}∥φ∥, equivalently continuous. Every such functional is an inner product in disguise.

Theorem4

For every bounded linear functional φ\varphiφ on HHH there is a unique y∈Hy\in Hy∈H with φ(x)=⟨x,y⟩\varphi(x)=\ip{x}{y}φ(x)=⟨x,y⟩ for all xxx, and ∥φ∥=∥y∥\norm{\varphi}=\norm{y}∥φ∥=∥y∥.

Proof

If φ=0\varphi=0φ=0 take y=0y=0y=0. Otherwise the kernel N={x:φ(x)=0}N=\{x:\varphi(x)=0\}N={x:φ(x)=0} is a closed proper subspace, so by Corollary 3 the complement N⊥N^\perpN⊥ contains a nonzero vector, which we scale to a unit vector zzz. For any xxx, the vector x−φ(x)φ(z) zx-\dfrac{\varphi(x)}{\varphi(z)}\,zx−φ(z)φ(x)​z lies in NNN, because φ\varphiφ sends it to zero, so it is orthogonal to zzz, giving ⟨x,z⟩=φ(x)φ(z)⟨z,z⟩=φ(x)φ(z)\ip{x}{z}=\dfrac{\varphi(x)}{\varphi(z)}\ip{z}{z} =\dfrac{\varphi(x)}{\varphi(z)}⟨x,z⟩=φ(z)φ(x)​⟨z,z⟩=φ(z)φ(x)​. Hence φ(x)=φ(z)⟨x,z⟩=⟨x,φ(z)z⟩\varphi(x)=\varphi(z)\ip{x}{z}=\ip{x}{\varphi(z)z}φ(x)=φ(z)⟨x,z⟩=⟨x,φ(z)z⟩, so y=φ(z)zy=\varphi(z)zy=φ(z)z represents φ\varphiφ. Uniqueness follows because ⟨x,y−y′⟩=0\ip{x}{y-y'}=0⟨x,y−y′⟩=0 for all xxx forces y=y′y=y'y=y′ on taking x=y−y′x=y-y'x=y−y′. For the norm, Cauchy-Schwarz gives ∣φ(x)∣=∣⟨x,y⟩∣≤∥y∥∥x∥\abs{\varphi(x)}=\abs{\ip{x}{y}}\le \norm{y}\norm{x}∣φ(x)∣=∣⟨x,y⟩∣≤∥y∥∥x∥ so ∥φ∥≤∥y∥\norm{\varphi}\le\norm{y}∥φ∥≤∥y∥, while φ(y)=∥y∥2\varphi(y)=\norm{y}^2φ(y)=∥y∥2 gives the reverse, so ∥φ∥=∥y∥\norm{\varphi}=\norm{y}∥φ∥=∥y∥.

These two theorems generate conditional expectation, the Radon-Nikodym density, and operator adjoints. The orthogonal projection is exactly conditional expectation, the projection of a random variable onto the closed subspace of variables measurable with respect to the conditioning information, and it is the least-squares solution of an overdetermined system. The Riesz representation theorem is the existence half of the Radon-Nikodym theorem, where a density is produced as the vector representing an absolutely continuous functional. It also gives every bounded operator an adjoint, the construction the spectral theorem needs. Completeness was the one nonformal ingredient in both, the property that let the nearest point exist, which is why the Hilbert space, and not merely the inner product space, is the right setting.

[1]
W. Rudin, Functional Analysis, 2nd ed. McGraw-Hill, 1991.

Part 3 of 7 in Hilbert Spaces and Operators

← previousL-squared and Completenessnext →Orthonormal Bases

Explore connections

see in the atlas →

related

  • Conditional Expectation
  • The Radon-Nikodym Theorem
  • Inner Product Spaces

referenced by (2)

  • Bounded Operators and the Adjoint
  • Convex Sets and Functions
cite
@misc{projection-and-riesz,
  author = {Zac Kienzle},
  title  = {Projection and Riesz Representation},
  year   = {2026},
  month  = {05},
  url    = {https://zackienzle.com/blog/projection-and-riesz}
}