Skip to content
homeaboutworkprojectsthesiswritingresume
Loading
~/blog/statistical-arbitrage0%dark
  1. home/
  2. writing/
  3. Statistical Arbitrage

02 June 2026 · 7 min read · updated 09 June 2026

Statistical Arbitrage

Pure arbitrage is a property of an equivalence class of measures and is annihilated by any equivalent martingale measure. Statistical arbitrage is instead a statement about the physical measure, constraining only the long-run mean, loss probability, and time-averaged variance of cumulative profit. We give full proofs of the finite fundamental theorem, the sufficiency of i.i.d. positive-mean increments, and the strict separation between the two notions.

  • 4 equations
  • 11 results
  • 7 connections
  • statistical-arbitrage
  • stochastic-finance
  • probability
On this page▾
  • Primitives
  • Arbitrage belongs to the equivalence class
  • The fundamental theorem, proved
  • The definition
  • Sufficiency
  • The hierarchy is strict
  • The word statistical and the role of the physical measure
  • Numerical illustration

7 min left

  • Primitives1m
  • Arbitrage belongs to the equivalence class1m
  • The fundamental theorem, proved2m
  • The definition1m
  • Sufficiency1m
  • The hierarchy is strict1m
  • The word statistical and the role of the physical measure1m
  • Numerical illustration1m

Pure arbitrage cannot encode a drift. It is invariant across an equivalence class of measures, so any equivalent martingale measure annihilates it. Real strategies lose on many paths yet cannot lose on average, a statement about the physical measure P\PP rather than the pricing measure Q\QQ. This post makes that statement precise.

#Primitives

Fix (Ω,F,(Ft)t≥0,P)(\Omega,\Filt,(\Filt_t)_{t\ge 0},\P)(Ω,F,(Ft​)t≥0​,P) under the usual conditions. Prices carry a positive numeraire BtB_tBt​, the money market Bt=ertB_t=e^{rt}Bt​=ert, and a semimartingale StS_tSt​, with discounted price S~t=St/Bt\tilde S_t = S_t/B_tS~t​=St​/Bt​. A strategy is a predictable, S~\tilde SS~-integrable process HHH; its discounted gain is the stochastic integral v(t)=∫0tHu dS~uv(t)=\int_0^t H_u\,d\tilde S_uv(t)=∫0t​Hu​dS~u​, so v(0)=0v(0)=0v(0)=0. Every expectation, variance, and probability is taken under P\PP unless a measure is named.

#Arbitrage belongs to the equivalence class

Definition1

An arbitrage is a strategy with v(0)=0v(0)=0v(0)=0, v(T)≥0v(T)\ge 0v(T)≥0 almost surely, and P(v(T)>0)>0\P(v(T)>0)>0P(v(T)>0)>0.

Lemma2

If Q∼P\Q\sim\PQ∼P, then vvv is an arbitrage under P\PP if and only if it is an arbitrage under Q\QQ.

Proof

Equivalence preserves the semimartingale property and the stochastic integral up to indistinguishability, so v(T)v(T)v(T) is the same random variable under P\PP and Q\QQ. Equivalence also gives identical null sets. The event {v(T)<0}\{v(T)<0\}{v(T)<0} is P\PP-null exactly when it is Q\QQ-null, so v(T)≥0v(T)\ge 0v(T)≥0 holds almost surely under either measure simultaneously, and {v(T)>0}\{v(T)>0\}{v(T)>0} carries positive mass under one measure exactly when it does under the other. Both clauses of Definition 1 are preserved, and the drift of SSS never appears.

By Lemma 2 arbitrage lives in the equivalence class [P][\P][P] and is blind to expected return. The construction below exploits this blindness.

#The fundamental theorem, proved

Let Ω={ω1,…,ωK}\Omega=\{\omega_1,\dots,\omega_K\}Ω={ω1​,…,ωK​} be finite with full support P(ωk)>0\P(\omega_k)>0P(ωk​)>0 for every kkk, and a single trading period spanning the dates 000 and TTT; the multiperiod case iterates over the filtration. Collect discounted price increments in G∈RK×dG\in\R^{K\times d}G∈RK×d, where Gkj=S~T j(ωk)−S~0 jG_{kj}=\tilde S^{\,j}_T(\omega_k)-\tilde S^{\,j}_0Gkj​=S~Tj​(ωk​)−S~0j​. A portfolio h∈Rdh\in\R^dh∈Rd has discounted payoff Gh∈RKGh\in\R^KGh∈RK, and an arbitrage is an hhh with Gh≥0Gh\ge 0Gh≥0 componentwise and Gh≠0Gh\ne 0Gh=0. No arbitrage is therefore exactly

range⁡(G)∩R≥0K={0}.(1)\range(G)\cap\R^K_{\ge 0}=\{0\}. \tag{1}range(G)∩R≥0K​={0}.(1)
Theorem3

(First Fundamental Theorem, finite Ω\OmegaΩ.) No arbitrage holds if and only if there exists q∈RKq\in\R^Kq∈RK with qk>0q_k>0qk​>0, ∑kqk=1\sum_k q_k=1∑k​qk​=1, and q⊤G=0q^\top G=0q⊤G=0. The measure Q=(qk)\Q=(q_k)Q=(qk​) is then equivalent to P\PP and renders discounted prices martingales, EQ[S~T−S~0]=0\E_\Q[\tilde S_T-\tilde S_0]=0EQ​[S~T​−S~0​]=0.

Proof

(⇐)(\Leftarrow)(⇐) Given such q>0q>0q>0, suppose hhh is an arbitrage. Then Gh≥0Gh\ge 0Gh≥0 with one strict component, so q⊤(Gh)>0q^\top(Gh)>0q⊤(Gh)>0, while q⊤(Gh)=(q⊤G)h=0q^\top(Gh)=(q^\top G)h=0q⊤(Gh)=(q⊤G)h=0, a contradiction.

(⇒)(\Rightarrow)(⇒) Assume no arbitrage. The range C:=range⁡(G)C:=\range(G)C:=range(G) is a linear subspace, hence closed and convex, and the unit simplex Δ={p∈R≥0K:∑kpk=1}\Delta=\{p\in\R^K_{\ge0}:\sum_k p_k=1\}Δ={p∈R≥0K​:∑k​pk​=1} is compact and convex. By Equation (1) the two are disjoint, since C∩R≥0K={0}C\cap\R^K_{\ge0}=\{0\}C∩R≥0K​={0} and 0∉Δ0\notin\Delta0∈/Δ. Strict separation yields ψ≠0\psi\neq 0ψ=0 and α\alphaα with

ψ⊤c  ≤  α  <  ψ⊤pfor all c∈C, p∈Δ.(2)\psi^\top c\;\le\;\alpha\;<\;\psi^\top p \qquad\text{for all } c\in C,\ p\in\Delta. \tag{2}ψ⊤c≤α<ψ⊤pfor all c∈C, p∈Δ.(2)

Since CCC is a subspace, ψ⊤c\psi^\top cψ⊤c is linear and bounded above on CCC, which forces ψ⊤c=0\psi^\top c=0ψ⊤c=0 for every c∈Cc\in Cc∈C and hence α≥0\alpha\ge 0α≥0. Evaluating the right inequality of Equation (2) at each vertex p=ekp=e_kp=ek​ gives ψk>0\psi_k>0ψk​>0. Put q=ψ/∑kψkq=\psi/\sum_k\psi_kq=ψ/∑k​ψk​, so q>0q>0q>0 and ∑kqk=1\sum_k q_k=1∑k​qk​=1. Since P\PP has full support, qk>0q_k>0qk​>0 for every kkk makes the null sets of Q\QQ and P\PP coincide, so Q∼P\Q\sim\PQ∼P. Orthogonality ψ⊥C\psi\perp Cψ⊥C reads ψ⊤(Gh)=0\psi^\top(Gh)=0ψ⊤(Gh)=0 for all hhh, that is ψ⊤G=0\psi^\top G=0ψ⊤G=0, whence q⊤G=0q^\top G=0q⊤G=0 and EQ[S~T−S~0]=0\E_\Q[\tilde S_T-\tilde S_0]=0EQ​[S~T​−S~0​]=0 [1], [2].

#The definition

Definition4

A statistical arbitrage is a strategy with v(0)=0v(0)=0v(0)=0 whose discounted cumulative profit, indexed by the completed trade count nnn, satisfies [3]

(i)lim inf⁡n→∞E[v(n)]>0,(ii)lim⁡n→∞P ⁣(v(n)<0)=0,(iii)lim⁡n→∞Var⁡[v(n)]n2=0.(3)\begin{aligned} &\text{(i)} &&\liminf_{n\to\infty}\E[v(n)]>0, \\[2pt] &\text{(ii)} &&\lim_{n\to\infty}\P\!\big(v(n)<0\big)=0, \\[2pt] &\text{(iii)} &&\lim_{n\to\infty}\frac{\Var[v(n)]}{n^{2}}=0. \end{aligned} \tag{3}​(i)(ii)(iii)​​n→∞liminf​E[v(n)]>0,n→∞lim​P(v(n)<0)=0,n→∞lim​n2Var[v(n)]​=0.​(3)

The index nnn runs over the disjoint trade-completion times, so the conditions are limits along that discrete sequence.

Condition (iii) says the variance of the time-averaged profit v(n)/nv(n)/nv(n)/n vanishes. Condition (i) is an expectation under P\PP, a real-world drift rather than a Q\QQ-mispricing, and that is the source of the word statistical.

#Sufficiency

Proposition5

Let disjoint trades produce discounted increments Y1,Y2,…Y_1,Y_2,\dotsY1​,Y2​,… i.i.d. with E[Yi]=μ>0\E[Y_i]=\mu>0E[Yi​]=μ>0 and Var⁡(Yi)=σ2∈(0,∞)\Var(Y_i)=\sigma^2\in(0,\infty)Var(Yi​)=σ2∈(0,∞). Then v(n)=∑i=1nYiv(n)=\sum_{i=1}^n Y_iv(n)=∑i=1n​Yi​ is a statistical arbitrage.

Proof

Write Yˉn=v(n)/n\bar Y_n=v(n)/nYˉn​=v(n)/n, with mean μ\muμ and variance σ2/n\sigma^2/nσ2/n. Condition (i) holds since E[v(n)]=nμ→∞\E[v(n)]=n\mu\to\inftyE[v(n)]=nμ→∞, so lim inf⁡nE[v(n)]=∞>0\liminf_n\E[v(n)]=\infty>0liminfn​E[v(n)]=∞>0. For condition (ii), Chebyshev gives

P ⁣(v(n)<0)=P ⁣(Yˉn−μ<−μ)≤P ⁣(∣Yˉn−μ∣≥μ)≤σ2nμ2,(4)\P\!\big(v(n)<0\big)=\P\!\big(\bar Y_n-\mu<-\mu\big)\le\P\!\big(\lvert\bar Y_n-\mu\rvert\ge\mu\big)\le\frac{\sigma^2}{n\mu^2}, \tag{4}P(v(n)<0)=P(Yˉn​−μ<−μ)≤P(∣Yˉn​−μ∣≥μ)≤nμ2σ2​,(4)

whose right side vanishes. Condition (iii) holds since Var⁡[v(n)]/n2=σ2/n\Var[v(n)]/n^2=\sigma^2/nVar[v(n)]/n2=σ2/n vanishes. Every clause of Definition 4 is met.

The bound Equation (4) is the entire engine. Any increments with positive mean and finite variance inherit the conclusion of Proposition 5.

#The hierarchy is strict

Proposition6

A repeatable, square-integrable arbitrage run on disjoint windows is a statistical arbitrage, and the converse fails.

Proof

A repeatable, square-integrable arbitrage furnishes i.i.d. increments with Yi≥0Y_i\ge 0Yi​≥0, E[Yi]>0\E[Y_i]>0E[Yi​]>0, and Var⁡(Yi)<∞\Var(Y_i)<\inftyVar(Yi​)<∞. Since v(n)≥0v(n)\ge 0v(n)≥0 surely, condition (ii) holds at once; condition (i) holds because E[v(n)]=n E[Y1]>0\E[v(n)]=n\,\E[Y_1]>0E[v(n)]=nE[Y1​]>0; and condition (iii) holds because Var⁡[v(n)]/n2=Var⁡(Y1)/n\Var[v(n)]/n^2=\Var(Y_1)/nVar[v(n)]/n2=Var(Y1​)/n vanishes. For the converse take Yi∼N(μ,σ2)Y_i\sim\mathcal N(\mu,\sigma^2)Yi​∼N(μ,σ2) with μ>0\mu>0μ>0. Here P(Yi<0)>0\P(Y_i<0)>0P(Yi​<0)>0, so vvv fails the almost-sure nonnegativity of Definition 1 and is no arbitrage, yet Proposition 5 certifies it as a statistical arbitrage.

#The word statistical and the role of the physical measure

Remark

Statistical arbitrage, unlike Lemma 2, is not invariant under an equivalent change of measure. Condition (i) demands μ=EP[Yi]>0\mu=\E_\P[Y_i]>0μ=EP​[Yi​]>0, and passing to the martingale measure Q\QQ of Theorem 3 alters the drift. Assume HHH is Q\QQ-admissible, so that vvv is a true Q\QQ-martingale rather than a strict local one; then under Q\QQ the discounted profit satisfies EQ[v(t)]=EQ[v(0)]=0\E_\Q[v(t)]=\E_\Q[v(0)]=0EQ​[v(t)]=EQ​[v(0)]=0, and condition (i) fails. An arbitrage-free market, one carrying an equivalent martingale measure, may therefore still admit statistical arbitrage under P\PP. The drift the pricing measure discards is precisely what separates the two notions.

#Numerical illustration

Simulating i.i.d. Gaussian increments with positive mean makes Equation (3) visible. The mean grows linearly, the loss probability collapses inside the Equation (4) envelope, and the averaged variance vanishes.

import numpy as np
from numpy.random import Generator


def loss_probability_envelope(mu: float, sigma: float, n: int) -> np.ndarray:
    """Chebyshev upper bound on the loss probability of cumulative profit.

    Args:
        mu: Per-trade expected discounted profit, strictly positive.
        sigma: Per-trade profit standard deviation.
        n: Horizon in number of trades.

    Returns:
        The bound sigma**2 / (k * mu**2) for k = 1, ..., n.
    """
    horizon = np.arange(1, n + 1)
    return sigma**2 / (horizon * mu**2)


def simulate_diagnostics(
    mu: float, sigma: float, n: int, paths: int, rng: Generator
) -> dict[str, np.ndarray]:
    """Monte Carlo diagnostics for the three limiting conditions.

    Args:
        mu: Per-trade expected discounted profit, strictly positive.
        sigma: Per-trade profit standard deviation.
        n: Horizon in number of trades.
        paths: Number of independent profit trajectories.
        rng: Seeded generator for reproducibility.

    Returns:
        Mapping of the cumulative-profit mean, empirical loss probability, and
        time-averaged variance against the horizon.
    """
    increments = rng.normal(mu, sigma, size=(paths, n))
    v = np.cumsum(increments, axis=1)
    horizon = np.arange(1, n + 1)
    return {
        "mean": v.mean(axis=0),
        "loss_probability": (v < 0).mean(axis=0),
        "averaged_variance": v.var(axis=0) / horizon**2,
    }


rng = np.random.default_rng(0)
diagnostics = simulate_diagnostics(mu=0.05, sigma=1.0, n=2_000, paths=20_000, rng=rng)
envelope = loss_probability_envelope(mu=0.05, sigma=1.0, n=2_000)

The construction is complete. A market may satisfy the fundamental theorem yet still admit a strategy meeting Definition 4, separated from arbitrage by a drift invisible to every equivalent martingale measure.

[1]
J. M. Harrison and S. R. Pliska, “Martingales and stochastic integrals in the theory of continuous trading,” Stochastic Processes and their Applications, vol. 11, no. 3, pp. 215–260, 1981.
[2]
H. Föllmer and A. Schied, Stochastic Finance: An Introduction in Discrete Time, 4th ed. De Gruyter, 2016.
[3]
S. Hogan, R. Jarrow, M. Teo, and M. Warachka, “Testing market efficiency using statistical arbitrage with applications to momentum and value strategies,” Journal of Financial Economics, vol. 73, no. 3, pp. 525–565, 2004.

Explore connections

see in the atlas →

related

  • The Ornstein-Uhlenbeck Process
  • Conditional Expectation
  • Price Formation in the Order Book

referenced by (1)

  • The Mean-Variance Portfolio
cite
@misc{statistical-arbitrage,
  author = {Zac Kienzle},
  title  = {Statistical Arbitrage},
  year   = {2026},
  month  = {06},
  url    = {https://zackienzle.com/blog/statistical-arbitrage}
}