Price Formation in the Order Book

The companion chapter on the limit order book describes the mechanism, the bids and asks, the price-time priority, the matching of incoming orders against resting depth, and the reading of the book as a supply and demand schedule. This chapter models the prices that mechanism produces. We keep the same notation for the best bid $B_t$ , the best ask $A_t$ , the mid $M_t=\half(A_t+B_t)$ , the spread $s_t=A_t-B_t$ , and the imbalance $I_t$ , and we add the probabilistic structure the models need.

#Primitives

Fix a filtered probability space $(\Omega,\Filt,(\Filt_t)_{t\ge 0},\P)$ satisfying the usual conditions of right-continuity and completeness, where $\Filt_t$ is the information carried by the order flow to time $t$ . Completeness lets us treat almost-surely defined conditional expectations as genuine adapted processes, and right-continuity secures càdlàg modifications so that first-passage and stopping-time arguments are well posed. A market buy prints at $A_t$ , a market sell at $B_t$ , so every print carries a side $q_t\in\{-1,+1\}$ . Every expectation, variance, and covariance is taken under the physical measure $\P$ .

#The efficient price is a martingale

Behind the quotes sits the value the market estimates. A process $(X_t,\Filt_t)$ is a martingale when it is integrable, adapted, and satisfies $\E[X_t\mid\Filt_s]=X_s$ almost surely for $s\le t$ (see the companion martingales chapter), the three clauses the proof below verifies in turn.

Definition1

The efficient price is the conditional expectation of the terminal value $V\in L^1$ given the order flow, $S^\ast_t=\E[V\mid\Filt_t]$ .

Theorem2

The efficient price is an $(\Filt_t)$ -martingale.

Proof

We verify the three clauses. For integrability, conditional Jensen applied to the convex map $x\mapsto\lvert x\rvert$ gives $\lvert S^\ast_t\rvert\le\E[\lvert V\rvert\mid\Filt_t]$ almost surely, and taking expectations with the tower property gives $\E\lvert S^\ast_t\rvert\le\E\lvert V\rvert<\infty$ for every $t$ , so $S^\ast_t\in L^1$ . For adaptedness, $S^\ast_t$ is $\Filt_t$ -measurable by the definition of conditional expectation, and we fix these canonical versions throughout, which is legitimate under completeness. For the martingale identity, the nesting $\Filt_s\subseteq\Filt_t$ and the tower property give

\E[S^\ast_t\mid\Filt_s]=\E\big[\E[V\mid\Filt_t]\mid\Filt_s\big]=\E[V\mid\Filt_s]=S^\ast_s \qquad (s\le t). \tag{1}

All three clauses hold, proving the claim.

Subtracting the $\Filt_s$ -measurable $S^\ast_s$ from Equation (1) gives the increment form

\E[S^\ast_t-S^\ast_s\mid\Filt_s]=0\qquad (s\le t), \tag{2}

so efficient-price increments carry no $\Filt_s$ -forecastable component. This is a theorem, not a hypothesis, because the efficient price is defined as a projection. Among all $\Filt_t$ -measurable square-integrable predictors $Y$ of $V$ , the conditional expectation uniquely minimises $\E[(V-Y)^2]$ ; it is the $L^2$ projection of $V$ onto $L^2(\Filt_t)$ , with orthogonality $\E[(V-S^\ast_t)Z]=0$ for every bounded $\Filt_t$ -measurable $Z$ . Defining the efficient price as this projection is the optimal use of order-flow information, and the martingale property is the microstructure shadow of the no-arbitrage statement that discounted prices are martingales under a pricing measure, here proved under the physical measure from the definition alone.

The observed mid is a noisy reading of $S^\ast$ . Write $M_t=S^\ast_t+\eta_t$ with $\eta_t:=M_t-S^\ast_t$ . Both $M$ and $S^\ast$ are $\Filt_t$ -adapted, so $\eta_t$ is too and $\E[\eta_t\mid\Filt_t]=\eta_t$ , leaving it uninformative against $\Filt_t$ itself. Centering requires a coarser value filtration $\mathcal G_t\subseteq\Filt_t$ that excludes the tick, spread, and queue state, in the sense $\E[\eta_t\mid\mathcal G_t]=0$ , or unconditionally $\E[\eta_t]=0$ . Under either centering the forecastable structure of the mid is carried entirely by the noise, because $\E[M_t-M_s\mid\Filt_s]=\E[\eta_t-\eta_s\mid\Filt_s]$ by Equation (2). The bid-ask bounce and short-horizon mean reversion of the mid are properties of $\eta$ , not of $S^\ast$ , and the empirical content of the decomposition is the centering of $\eta$ , which is falsifiable, not the martingale property of $S^\ast$ , which is a theorem.

#The bid-ask bounce and the spread

A market order buys at the ask and sells at the bid, so consecutive prints bounce across the spread even when the efficient price stands still. Roll's observation is that the bounce leaves a negative signature in the serial covariance of price changes [1].

Theorem3

Let the efficient price be a random walk $m_t=m_{t-1}+u_t$ with $u_t$ i.i.d., mean zero, variance $\sigma^2$ , and let trades print at $p_t=m_t+\half s\,q_t$ with side $q_t\in\{-1,+1\}$ i.i.d., symmetric, and independent of $(u_t)$ . Then

\Cov(\Delta p_t,\Delta p_{t-1})=-\frac{s^2}{4}\le 0, \tag{3}

so the effective spread is $s=2\sqrt{-\Cov(\Delta p_t,\Delta p_{t-1})}$ .

Proof

Differencing the print equation and using $m_t-m_{t-1}=u_t$ ,

\Delta p_t=u_t+\half s\,(q_t-q_{t-1}). \tag{4}

A symmetric $\pm 1$ side has $\E[q_t]=0$ and $\E[q_t^2]=1$ , so $\Var(q_t)=1$ , and by the i.i.d. assumption $\Cov(q_i,q_j)=\one\{i=j\}$ . Expanding the covariance of Equation (4) at consecutive dates by bilinearity gives four blocks. The block $\Cov(u_t,u_{t-1})$ vanishes because $u$ has zero autocovariance. The two cross blocks $\Cov(u_t,q_{t-1}-q_{t-2})$ and $\Cov(q_t-q_{t-1},u_{t-1})$ vanish because $u$ is independent of the sides, which factors each expectation and cancels it. Only the side block survives,

\Cov(\Delta p_t,\Delta p_{t-1})=\frac{s^2}{4}\,\Cov\big(q_t-q_{t-1},\,q_{t-1}-q_{t-2}\big). \tag{5}

Expanding the bilinear form into its four terms,

\Cov(q_t,q_{t-1})-\Cov(q_t,q_{t-2})-\Cov(q_{t-1},q_{t-1})+\Cov(q_{t-1},q_{t-2})=0-0-1+0=-1, \tag{6}

so the side block equals $-s^2/4$ , which is Equation (3). The inversion takes the positive root because $s\ge 0$ , and the population covariance is non-positive, so the root is real.

The drift of $m$ never appears, so the spread is read off price data with no model of returns, and three corollaries sharpen the picture.

Proposition4

Under the assumptions of Theorem 3, the autocovariance of price changes vanishes at every lag $k\ge 2$ , the efficient-price variance is $\sigma^2=\Var(\Delta p_t)-s^2/2$ , and the per-period variance of the $k$ -step return is $\Var(p_t-p_{t-k})/k=\sigma^2+s^2/(2k)$ .

Proof

For $k\ge 2$ the side and innovation indices in $\Delta p_t$ and $\Delta p_{t-k}$ are disjoint, so every block vanishes by independence and the autocovariance is zero. Price changes are therefore an MA(1) process with a single negative autocovariance. For the variance, $\Var(\Delta p_t)=\Var(u_t)+\tfrac{s^2}{4}\Var(q_t-q_{t-1})=\sigma^2+\tfrac{s^2}{4}\cdot 2 =\sigma^2+s^2/2$ , and rearranging recovers $\sigma^2$ . For the $k$ -step return, $p_t-p_{t-k}=(m_t-m_{t-k})+\half s(q_t-q_{t-k})$ splits into independent pieces of variance $k\sigma^2$ and $\tfrac{s^2}{4}\cdot 2=s^2/2$ , so $\Var(p_t-p_{t-k})=k\sigma^2+s^2/2$ and dividing by $k$ gives the stated ratio.

The per-period variance $\Var(p_t-p_{t-k})/k=\sigma^2+s^2/(2k)$ is largest at $k=1$ , where it equals $\sigma^2+s^2/2$ , and decreases monotonically toward the fundamental variance $\sigma^2$ as $k\to\infty$ ; equivalently the variance ratio $\mathrm{VR}(k)=V(k)/V(1)$ starts at one and falls toward $\sigma^2/(\sigma^2+s^2/2)<1$ . Negatively autocorrelated bid-ask bounce gives a variance ratio below one that decays with horizon and plateaus at the long-run level rather than a ratio that rises to one, the volatility signature plot that diagnoses microstructure noise, and $\sigma^2=\Var(\Delta p_t)-2\lvert\gamma_1\rvert$ separates fundamental volatility from the noise variance using only the lag-zero and lag-one autocovariances $\gamma_0,\gamma_1$ . The idealisation that prints land exactly at $m\pm s/2$ makes the recovered $s$ the effective round-trip cost, which equals the quoted width only without price improvement; real fills inside the quotes shrink the bounce and the estimator returns the smaller effective spread. The sharper caveat is order-flow autocorrelation. If sides persist with $\Cov(q_t,q_{t-k})=\rho_k$ , the four-term expansion Equation (6) becomes $2\rho_1-\rho_2-1$ , so $\Cov(\Delta p_t,\Delta p_{t-1})=\tfrac{s^2}{4}(2\rho_1-\rho_2-1)$ and positive persistence $\rho_1>0$ pushes the covariance toward zero, biasing the naive estimator downward and voiding it once $2\rho_1-\rho_2\ge 1$ . Roll's formula assumes the bounce is the only source of serial correlation in prints.

As a worked instance take an efficient-price volatility $\sigma=0.01$ and a spread $s=0.05$ . The lag-one autocovariance is $-s^2/4=-6.25\times 10^{-4}$ , inverting to $s=2\sqrt{6.25\times 10^{-4}}=0.05$ , and the per-period return variance falls from $\sigma^2+s^2/2=1.35\times 10^{-3}$ sampled at unit lag toward $\sigma^2=10^{-4}$ at long lag, the signature plot that distinguishes the spread term from the fundamental variance.

#Adverse selection sets a spread

Roll's bounce produces a spread with symmetric information. Glosten and Milgrom show a spread arises from information alone, with zero processing cost, because a resting quote is picked off by traders who know more [2].

Theorem5

Let the value be binary $V\in\{V_L,V_H\}$ with $\P(V=V_H)=\half$ . A fraction $\pi$ of arriving traders are informed and trade in the direction of $V$ ; the remaining $1-\pi$ are noise traders who buy or sell with probability $\half$ each, independent of $V$ . A competitive risk-neutral market maker quotes the conditional expectations

a=\E[V\mid\text{buy}],\qquad b=\E[V\mid\text{sell}]. \tag{7}

Then the spread is

a-b=\pi\,(V_H-V_L), \tag{8}

strictly positive whenever $\pi>0$ , and the quote mid equals the prior mean $\half(V_H+V_L)$ .

Proof

Take "competitive" to mean free entry, at least two Bertrand-competing makers, so equilibrium drives expected profit per quote to zero. Competition and risk-neutrality then force the quotes in Equation (7), since any ask above $\E[V\mid\text{buy}]$ is undercut by a rival and any ask below it loses money on average against the informed flow, and symmetrically for the bid. An informed trader buys exactly when $V=V_H$ , so $\P(\text{buy}\mid V_H)=\pi+(1-\pi)\half=\half(1+\pi)$ and $\P(\text{buy}\mid V_L)=(1-\pi)\half=\half(1-\pi)$ . With the uniform prior the buy probability is $\half$ , so Bayes gives $\P(V_H\mid\text{buy})=\half(1+\pi)$ and $\P(V_L\mid\text{buy})=\half(1-\pi)$ , whence $a=V_H\tfrac{1+\pi}{2}+V_L\tfrac{1-\pi}{2}$ . By the symmetry of a sell, $b=V_H\tfrac{1-\pi}{2}+V_L\tfrac{1+\pi}{2}$ . Subtracting gives Equation (8), and $a+b=V_H+V_L$ gives the mid.

With $V_H=101$ , $V_L=99$ , and an informed fraction $\pi=0.2$ , the ask is $101\cdot 0.6+99\cdot 0.4=100.2$ , the bid is $101\cdot 0.4+99\cdot 0.6=99.8$ , the spread is $\pi(V_H-V_L)=0.4$ , and the mid is the prior mean $100$ . Doubling the informed fraction to $\pi=0.4$ doubles the spread to $0.8$ , the linear adverse-selection cost of trading against better information.

The static spread is one frame of a dynamic process. With a general prior the quotes still bracket the value, and the belief they encode walks toward the truth as trades accumulate.

Proposition6

For a general prior $\P(V=V_H)=\theta$ , the quotes $a=\E[V\mid\text{buy}]$ and $b=\E[V\mid\text{sell}]$ keep a strictly positive spread for every $\pi>0$ , largest at $\theta=\half$ and shrinking to zero as $\theta\to 0$ or $\theta\to 1$ . The posterior value $m_t=\E[V\mid\Filt_t]$ is a martingale, and as trades accumulate it converges almost surely to $V$ , so the spread vanishes.

Proof

The trade likelihoods are $\P(\text{buy}\mid V_H)=\half(1+\pi)$ and $\P(\text{buy}\mid V_L)=\half(1-\pi)$ , with the sell case reflected, so Bayes maps each trade to a posterior update $\theta\mapsto\theta'$ that raises $\theta$ on a buy and lowers it on a sell. Writing $m_t=V_L+(V_H-V_L)\theta_t$ with $\theta_t=\P(V=V_H\mid\Filt_t)$ , the tower property gives $\E[\theta_{t+1}\mid\Filt_t]=\theta_t$ , so $m_t$ is a bounded martingale and converges almost surely by the martingale convergence theorem. Each trade carries strictly positive information for $\pi>0$ , so the limiting $\sigma$ -field distinguishes $V_H$ from $V_L$ and the limit is $V$ . The spread is the dispersion of $V$ under the trade-conditional posteriors, which is greatest when the prior is most uncertain and contracts as the belief concentrates.

Starting from $\theta=\half$ with $\pi=0.2$ , a buy raises the posterior to $\theta=0.6$ , moves the posterior value from $100$ to $99+2(0.6)=100.20$ , and narrows the spread from $0.40$ to $0.385$ , the quotes tightening as the belief walks toward the truth.

The Roll and Glosten-Milgrom spreads are additive components of the same observed width, one from processing the bounce and one from adverse selection. After a trade the market maker's posterior moves to the executed-side conditional expectation, so the transaction-revised price is a martingale by the tower property, and price impact is revealed as Bayesian updating rather than a separate force. The sequential unit-trade structure here is the simple limit; the batched divisible-quantity limit is Kyle's model below, and both price the same informational wedge.

#Fill probability from queue position

A passive order rests in a first-in, first-out queue at the best bid. Let its queue position $q$ be the number of units that must execute before and including the order, so $q-1$ rest ahead. Assume constant intensities, execution as the only advancement mechanism, and an order that rests indefinitely without own cancellation. Market sell orders arrive as a Poisson process of rate $\mu$ and remove the unit at the front; an adverse move of the best bid arrives as an independent Poisson process of rate $\kappa$ and abandons the level, leaving the order unfilled [3].

Lemma7

For independent $T_\mu\sim\mathrm{Exp}(\mu)$ and $T_\kappa\sim\mathrm{Exp}(\kappa)$ , $\P(T_\mu<T_\kappa)=\mu/(\mu+\kappa)$ , the minimum is $\mathrm{Exp}(\mu+\kappa)$ , and the identity of the winner is independent of the minimum.

Proof

Conditioning on $T_\mu=t$ with density $\mu e^{-\mu t}$ and using $\P(T_\kappa>t)=e^{-\kappa t}$ ,

\P(T_\mu<T_\kappa)=\int_0^\infty\mu e^{-\mu t}e^{-\kappa t}\,dt =\mu\int_0^\infty e^{-(\mu+\kappa)t}\,dt=\frac{\mu}{\mu+\kappa}. \tag{9}

The minimum of independent exponentials is exponential with the summed rate, and the joint density factors into a function of the minimum and the winner label, giving independence.

Proposition8

Under the stated assumptions the probability the order is filled before the level moves is

\P(\text{fill})=\Big(\frac{\mu}{\mu+\kappa}\Big)^{q}. \tag{10}

Proof

Superpose the two Poisson processes into one of rate $\mu+\kappa$ and label each event by its source. By Lemma 7 each event is an execution with probability $\mu/(\mu+\kappa)$ and an adverse move with probability $\kappa/(\mu+\kappa)$ . Independence and stationarity of Poisson increments, together with rates that do not depend on the position, make the labels of successive events independent and identically distributed Bernoulli trials. Pass to the embedded jump chain; by the strong Markov property at each event time the future is independent of the past given the state, and since the split probability is the same in every position the trials are i.i.d. A fill is the event that the first $q$ labels are all executions, which has probability $(\mu/(\mu+\kappa))^q$ .

The fill probability decays geometrically in depth and Equation (10) prices queueing behind size. Two extensions matter. First, the result is a lower bound on real fill probability because it excludes cancellations of the orders ahead, which also advance the position. If each of the $q-1$ orders ahead cancels independently at rate $\nu$ , then in position $j$ the advancement rate is $\mu+(j-1)\nu$ against abandonment $\kappa$ , the races stay independent but cease to be identical, and the product becomes

\P(\text{fill})=\prod_{j=1}^{q}\frac{\mu+(j-1)\nu}{\mu+(j-1)\nu+\kappa}, \tag{11}

which exceeds Equation (10) for $\nu>0$ and collapses to it at $\nu=0$ . Second, conditioning on a fill leaves each sojourn exponential with rate $\mu+\kappa$ , so the expected time to fill given a fill is $\E[T\mid\text{fill}]=q/(\mu+\kappa)$ , distinct from the unconditional expected time to leave the level, $(1-(\mu/(\mu+\kappa))^q)/\kappa$ . A trader weighs the discount of resting, the saved half-spread, against the non-fill risk and the adverse selection of fills that cluster exactly when the price is about to move against the resting side.

For a numerical case take execution rate $\mu=8$ and adverse-move rate $\kappa=2$ per second. A position $q=4$ fills with probability $(8/10)^4=0.4096$ , in expected time $4/10=0.4$ seconds given a fill. Letting the three orders ahead cancel at rate $\nu=1$ raises the fill probability to the product $\tfrac{8}{10}\cdot\tfrac{9}{11}\cdot\tfrac{10}{12}\cdot\tfrac{11}{13}=0.462$ by Equation (11), the gain from queue attrition ahead of the order.

#Which way the mid moves

The same Poisson primitives that fill an order also move the price. Reduce each best queue to a birth-death walk with arrival rate $\lambda$ adding depth and depletion rate $\mu$ removing it, and let the mid tick up when the ask queue empties before the bid queue. The single-queue first-passage probability is the gambler's ruin [3].

Theorem9

A birth-death queue of size $n$ with up-rate $\lambda$ and down-rate $\mu$ , started at $0<n<N$ , empties before reaching $N$ with probability

h(n)=\frac{\rho^{n}-\rho^{N}}{1-\rho^{N}},\qquad \rho=\frac{\mu}{\lambda}, \tag{12}

and when the queue drifts up, $\rho<1$ , the probability it ever empties is $\lim_{N\to\infty}h(n)=\rho^{\,n}=(\mu/\lambda)^{n}$ .

Proof

Only the embedded jump chain matters for which boundary is hit, since the holding times do not affect the order of jumps. In the jump chain the size rises by one with probability $p=\lambda/(\lambda+\mu)$ and falls by one with probability $1-p=\mu/(\lambda+\mu)$ . The absorption probability $h(n)=\P_n(\text{hit }0\text{ before }N)$ satisfies the harmonic equation $h(n)=p\,h(n+1)+(1-p)\,h(n-1)$ with $h(0)=1$ , $h(N)=0$ . The general solution is $h(n)=C_1+C_2\rho^n$ with $\rho=(1-p)/p=\mu/\lambda$ , since $\rho$ solves the characteristic equation $p\rho^2-\rho+(1-p)=0$ . Imposing the boundary values gives $C_2=1/(1-\rho^N)$ and $C_1=-\rho^N/(1-\rho^N)$ , which is Equation (12). Letting $N\to\infty$ with $\rho<1$ sends $\rho^N\to 0$ and leaves $\rho^n$ .

Racing the two best queues, the mid ticks up when the ask empties first, an event whose probability increases in the imbalance $I=Q_b/(Q_b+Q_a)$ , the bid share of touch depth. The monotonicity is a coupling argument. Raising $Q_b$ at fixed $Q_a$ lengthens the bid's first-passage time to zero pathwise while leaving the ask's unchanged, so the ask wins more often, and a heavier bid queue predicts an up move. Imbalance is therefore the natural state variable for the fair price, which the next section makes precise.

#The microprice

The mid is not the fair value. A heavy bid queue tilts the next move up, so the value conditional on the book sits above the mid, and the correction is a function of imbalance [4].

Definition10

The microprice is the expected mid at the next price move conditional on the current state, $P^{\mathrm{micro}}_t=\E[M_{\tau}\mid M_t,I_t]$ , where $\tau$ is the time of the next mid move.

Theorem11

The microprice is the one-step minimum-mean-squared-error predictor of the next observed mid and decomposes as $P^{\mathrm{micro}}_t=M_t+g(I_t)$ with $g$ odd about $I=\half$ , $g(\half)=0$ , and $g$ increasing in $I$ . To first order in the tick it is the imbalance-weighted touch

P^{\mathrm{micro}}_t=I_t\,A_t+(1-I_t)\,B_t. \tag{13}

Proof

By construction the microprice is the conditional expectation $\E[M_\tau\mid M_t,I_t]$ , which Proposition 12 shows is the unique state-measurable predictor of $M_\tau$ minimising mean-squared error; the mid is the suboptimal special case, since $\E[M_\tau\mid M_t,I_t]-M_t=g(I_t)\neq 0$ off balance. The up-move probability of the previous section is increasing in $I$ and symmetric under the swap $I\mapsto 1-I$ with $A\leftrightarrow B$ , so the expected signed mid move inherits oddness about $I=\half$ , vanishing at $I=\half$ and increasing in $I$ . Write the next mid move as $\pm s/2$ to the opposite touch, up with probability $p_{\mathrm{up}}$ and down otherwise, so $P^{\mathrm{micro}}_t-M_t=\tfrac{s}{2}(2 p_{\mathrm{up}}-1)$ . Linearising the gambler's-ruin up probability of the previous section as $p_{\mathrm{up}}=I_t+O(\text{tick})$ , that is taking imbalance to equal the up probability to first order, matches $I_t A_t+(1-I_t)B_t=M_t+s(I_t-\half)$ , which is Equation (13).

At $I=\half$ the microprice is the mid; a full bid queue, $I\to 1$ , pulls the fair value to the ask. The mid is a biased predictor of the next price and the microprice removes the bias, and the sense in which it is the right predictor is exact.

Proposition12

Among all predictors of the next mid measurable with respect to the current state, the microprice minimises mean-squared error.

Proof

For any state-measurable predictor $g$ ,

\E[(M_\tau-g)^2]=\E[(M_\tau-P^{\mathrm{micro}})^2]+\E[(P^{\mathrm{micro}}-g)^2], \tag{14}

since the cross term $\E[(M_\tau-P^{\mathrm{micro}})(P^{\mathrm{micro}}-g)]$ vanishes when both $P^{\mathrm{micro}}$ and $g$ are state-measurable, by the orthogonality of the conditional expectation. The second term of Equation (14) is nonnegative and is zero exactly when $g=P^{\mathrm{micro}}=\E[M_\tau\mid M_t,I_t]$ , so the microprice is the unique minimiser. The mid is the special case $g=M_t$ , which is suboptimal off balance by the squared bias $g(I_t)^2$ .

This is why execution and signal models condition on imbalance rather than on the mid alone.

#Information becomes impact

The spread and the queue defend against one hazard. A resting quote can be hit by a trader who knows more, and Kyle's model makes the resulting price impact exact in a batched auction [5].

Theorem13

A risk-neutral informed trader observes the terminal value $V\sim\mathcal N(p_0,\Sigma_0)$ and submits demand $x$ ; noise traders submit $u\sim\mathcal N(0,\sigma_u^2)$ independent of $V$ ; a competitive market maker observes only the aggregate flow $y=x+u$ and sets $p=\E[V\mid y]$ . The unique linear equilibrium is

x=\beta\,(V-p_0),\qquad p=p_0+\lambda\,y,\qquad \beta=\frac{\sigma_u}{\sqrt{\Sigma_0}},\qquad \lambda=\frac{\sqrt{\Sigma_0}}{2\,\sigma_u}, \tag{15}

and exactly half the prior variance is resolved, $\Var(V\mid y)=\Sigma_0/2$ .

Proof

Conjecture the linear forms and solve for a consistent pair $(\beta,\lambda)$ . Given $p=p_0+\lambda y$ , and since $x$ is a function of the observed $V$ while $u$ is independent of $V$ with mean zero, $\E[ux\mid V]=x\,\E[u]=0$ , so the informed trader's expected profit is

\E\big[(V-p)\,x\mid V\big]=\E\big[(V-p_0-\lambda(x+u))\,x\mid V\big]=(V-p_0)\,x-\lambda x^2. \tag{16}

The objective is bounded above only if $\lambda>0$ , for if $\lambda\le 0$ it is convex or linear in $x$ and unbounded, so no optimal demand exists and there is no equilibrium. Under $\lambda>0$ the first-order condition $(V-p_0)-2\lambda x=0$ gives $x=(V-p_0)/(2\lambda)$ , the second derivative $-2\lambda<0$ confirms the maximum, and $\beta=1/(2\lambda)$ . Competition forces the maker to break even, $p=\E[V\mid y]$ , which Bertrand undercutting between two or more makers selects as the unique no-undercutting schedule. The pair $(V,y)$ is jointly Gaussian with $y=\beta(V-p_0)+u$ of mean zero, so the projection theorem gives

\E[V\mid y]=p_0+\frac{\Cov(V,y)}{\Var(y)}\,y=p_0+\frac{\beta\Sigma_0}{\beta^2\Sigma_0+\sigma_u^2}\,y, \tag{17}

using $\Cov(V,y)=\beta\Sigma_0$ and $\Var(y)=\beta^2\Sigma_0+\sigma_u^2$ , so $\lambda=\beta\Sigma_0/(\beta^2\Sigma_0+\sigma_u^2)$ . Substituting $\beta=1/(2\lambda)$ and clearing denominators leaves $\Sigma_0+4\lambda^2\sigma_u^2=2\Sigma_0$ , hence $\lambda^2=\Sigma_0/(4\sigma_u^2)$ ; the positive root is forced by $\lambda>0$ and by $\lambda=\Cov(V,y)/\Var(y)$ sharing the sign of $\beta$ , giving Equation (15). The posterior variance is $\Var(V\mid y)=\Sigma_0-\Cov(V,y)^2/\Var(y)=\Sigma_0-\Sigma_0/2=\Sigma_0/2$ on substituting the equilibrium $\beta$ . Within the linear class the system has this single positive solution, so the linear equilibrium is unique.

The impact slope $\lambda$ rises with the value uncertainty $\Sigma_0$ the maker faces and falls with the noise volume $\sigma_u$ that camouflages the informed order. Exactly half the private signal is impounded, since $\lambda\beta=\half$ gives $\E[\,p-p_0\mid V\,]=\half(V-p_0)$ , the price companion of the variance result $\Var(V\mid y)=\Sigma_0/2$ . The informed trader earns $\E[(V-p)x]=\lambda\sigma_u^2=\half\sqrt{\Sigma_0}\,\sigma_u$ , paid by the noise traders against the break-even maker, increasing in both prior uncertainty and the noise volume to hide behind. Glosten-Milgrom is the sequential unit-trade dual in which the same wedge appears as a bid-ask spread rather than a slope, and the informed trader here is a strategic monopolist who shades demand, the factor $\half$ being exactly that restraint. In the continuous-time limit of $N$ auctions the slope is constant in time and equal to $\sqrt{\Sigma_0}/\sigma_u$ without the factor two, the posterior variance falls linearly to zero, $\Var(V\mid\Filt_t)=\Sigma_0(1-t)$ , and the price runs as a Brownian bridge from $p_0$ to $V$ with full revelation at the close.

Take prior variance $\Sigma_0=4$ , so $\sqrt{\Sigma_0}=2$ , and noise volatility $\sigma_u=1$ . Then $\lambda=2/(2\cdot 1)=1$ , $\beta=1/2$ , the product $\lambda\beta=\half$ , the posterior variance is $\Sigma_0/2=2$ , and the informed trader's expected profit is $\half\sqrt{\Sigma_0}\,\sigma_u=1$ . The continuous-time slope on the same primitives is $\sqrt{\Sigma_0}/\sigma_u=2$ , twice the single-period value, the factor the two settings must not share. Glosten-Milgrom and Kyle price the same informational wedge through different mechanisms.

Feature	Glosten-Milgrom	Kyle
Trade structure	Sequential unit trades	One batched auction
Informed trader	Price taker	Strategic monopolist
Price object	Bid and ask quotes	One linear price
Wedge	Spread $\pi(V_H-V_L)$	Slope $\lambda=\sqrt{\Sigma_0}/(2\sigma_u)$
Information revealed	Toward full over many trades	Half the variance in one shot

#Inventory and the maker's reservation price

Adverse selection is one cost the maker bears; inventory risk is the other. A maker filled on one side accumulates a position in an asset whose value moves, and a risk-averse maker prices that exposure into its quotes, skewing them to mean-revert inventory toward zero [6].

Theorem14

A maker with constant absolute risk aversion $\gamma$ holding inventory $q$ in an asset whose mid $s$ follows arithmetic Brownian motion with volatility $\sigma$ over a remaining horizon $T-t$ values the inventory at the certainty equivalent $s\,q-\half\gamma\sigma^2 q^2(T-t)$ , and its reservation price, the per-unit indifference value at which it will add one more unit, is

r(s,q,t)=s-q\,\gamma\sigma^2(T-t). \tag{18}

Proof

Under exponential utility and a Gaussian mid the certainty equivalent of holding $q$ units to the horizon is the mean value minus a risk penalty equal to half the risk aversion times the variance of the terminal position, $\E[s_T]q-\half\gamma\Var(s_T)q^2=s\,q-\half\gamma\sigma^2(T-t)q^2$ , since the arithmetic Brownian mid has $\E[s_T]=s$ and $\Var(s_T)=\sigma^2(T-t)$ . The reservation price is the marginal certainty equivalent, the derivative with respect to $q$ , $\partial_q\big(s\,q-\half\gamma\sigma^2 q^2(T-t)\big)=s-q\gamma\sigma^2(T-t)$ , which is Equation (18). A long maker, $q>0$ , prices below the mid to encourage selling the inventory off, and a short maker prices above it, with the skew vanishing at the horizon.

The reservation price is where the maker centres its quotes, and the optimal half-spread around it trades the profit of a wider quote against the lower fill rate it implies. Solving the maker's utility maximisation against an exponential fill intensity $\lambda(\delta)=A\,e^{-\kappa\delta}$ in the quote distance $\delta$ gives the Avellaneda-Stoikov optimal total spread

\delta_a+\delta_b=\gamma\sigma^2(T-t)+\frac{2}{\gamma}\ln\!\Big(1+\frac{\gamma}{\kappa}\Big), \tag{19}

with the bid and ask placed symmetrically about the reservation price rather than about the mid [7]. As a worked case take $s=100$ , risk aversion $\gamma=0.1$ , volatility $\sigma=0.5$ so $\sigma^2=0.25$ , horizon $T-t=1$ , and fill decay $\kappa=1.5$ . The total spread is $0.1\cdot 0.25\cdot 1+\tfrac{2}{0.1}\ln(1+0.1/1.5)=0.025+1.291=1.316$ , a half-spread of $0.658$ , and the reservation price and quotes skew with inventory.

Inventory $q$	Reservation $r$	Bid	Ask
$+5$ (long)	$99.875$	$99.217$	$100.533$
$0$ (flat)	$100.000$	$99.342$	$100.658$
$-5$ (short)	$100.125$	$99.467$	$100.783$

The long maker lowers both quotes to sell its position down, the short maker raises them to buy back, and the reservation skew $q\gamma\sigma^2(T-t)$ is the inventory leg of the spread that sits beside the adverse-selection leg of Glosten and Milgrom and the order-processing leg of Roll.

#Optimal execution

Price formation tells a trader what the book will do. Execution asks what the trader should do in return, liquidating a position without paying away the impact the previous sections priced [8].

Theorem15

Liquidate $X$ shares over $[0,T]$ with holdings $x(t)$ , $x(0)=X$ , $x(T)=0$ . Let temporary impact cost the rate $\eta\,\dot x(t)^2$ and let the unaffected price carry volatility $\sigma$ , with mean-variance risk aversion $\gamma$ . The trajectory minimising $\E[\text{cost}]+\gamma \Var[\text{cost}]$ solves $\eta\,\ddot x=\gamma\sigma^2 x$ , giving

x(t)=X\,\frac{\sinh\!\big(\theta(T-t)\big)}{\sinh(\theta T)},\qquad \theta=\sqrt{\frac{\gamma\sigma^2}{\eta}}, \tag{20}

which front-loads selling, and the risk-neutral limit $\gamma\to 0$ is the straight line $x(t)=X(1-t/T)$ of constant participation.

Proof

Optimise over deterministic, pre-committed trajectories $x(t)$ . Temporary impact accumulates expected cost $\eta\int_0^T\dot x^2\,dt$ , and the residual position held against a random walk of volatility $\sigma$ incurs price risk $\int_0^T x(t)\,dW_t$ of mean zero and, by the Ito isometry on the deterministic integrand, variance $\sigma^2\int_0^T x^2\,dt$ , the permanent-impact term being path-independent and dropped. The objective is the functional

J[x]=\int_0^T\big(\eta\,\dot x^2+\gamma\sigma^2 x^2\big)\,dt. \tag{21}

Its Euler-Lagrange equation $\frac{d}{dt}(2\eta\dot x)=2\gamma\sigma^2 x$ reduces to $\eta\ddot x=\gamma\sigma^2 x$ , a linear second-order equation with general solution $x(t)=C_1 e^{\theta t}+C_2 e^{-\theta t}$ and $\theta=\sqrt{\gamma\sigma^2/\eta}$ . The boundary conditions $x(0)=X$ , $x(T)=0$ select the hyperbolic-sine combination Equation (20), the optimal schedule within the deterministic class. As $\gamma\to 0$ the equation becomes $\ddot x=0$ , whose boundary solution is the straight line, and as $\gamma\to\infty$ the trajectory liquidates immediately.

The two limits are the entire trade-off. A patient risk-neutral trader spreads the order uniformly to minimise impact, and a risk-averse trader front-loads to cut exposure to price risk, the curvature $\theta$ setting the urgency. Kyle's $\lambda$ is the permanent-impact slope that the trajectory pays once on net, while $\eta$ is the temporary slope it pays on every instant of trading, the same linearity producing two distinct costs.

As a worked case set the urgency so that $\theta T=2$ . The fraction of the position still held at the midpoint of the schedule is $x(T/2)/X=\sinh(\theta T/2)/\sinh(\theta T)=\sinh 1/\sinh 2=0.324$ , so $67.6\%$ of the order is sold in the first half of the window, the front-loading a risk-averse trader chooses. Sending $\theta\to 0$ flattens the curve to the straight-line schedule $x(t)=X(1-t/T)$ , which holds exactly half the position at $t=T/2$ .

#Transient impact and resilience

Kyle's impact is permanent and the Almgren-Chriss temporary impact vanishes the instant trading stops. Real impact lies between. A market order consumes depth and pushes the price, then the book replenishes and the price relaxes part of the way back, so impact is transient and the speed of its decay is the resilience of the book [9].

Model the displacement of a signed trade of size $v$ at time zero as $\kappa v$ , relaxing at rate $\rho$ , so its contribution to the price at time $t$ is $\kappa v e^{-\rho t}$ . For a trading rate $\dot v_s$ the impact is the convolution of the flow against the resilience kernel,

I_t=\int_0^t G(t-s)\,\dot v_s\,ds,\qquad G(\tau)=\kappa\,e^{-\rho\tau}. \tag{22}

Proposition16

Under Equation (22) a constant trading rate $\dot v_s\equiv c$ drives the impact to the steady state $I_\infty=\kappa c/\rho$ , while a single block of size $v$ followed by no trading leaves a residual impact that halves every $(\ln 2)/\rho$ in time.

Proof

For the constant rate, $I_t=\kappa c\int_0^t e^{-\rho(t-s)}\,ds=\dfrac{\kappa c}{\rho}(1-e^{-\rho t}) \to\dfrac{\kappa c}{\rho}$ . For the block, taking $\dot v_s=v\,\delta(s)$ gives $I_t=\kappa v e^{-\rho t}$ , which falls by half when $e^{-\rho t}=\half$ , that is at $t=(\ln 2)/\rho$ .

The steady state is the standing impact a sustained participation rate maintains, the resilience pulling against the trading pushing, and the model recovers the two extremes as limits, $\rho\to 0$ giving permanent impact and $\rho\to\infty$ giving purely temporary impact. A linear propagator must respect a no-arbitrage constraint, since a kernel that relaxed the wrong way would let a round trip profit, and the Gatheral condition gives a sufficient guarantee, that for linear impact a non-increasing and convex kernel $G$ excludes price manipulation, a test the exponential resilience passes [10]. Under Equation (22) the optimal liquidation is no longer the smooth Almgren-Chriss curve but a discrete block at the start, a constant rate through the middle, and a block at the close, the trader hitting fresh depth hard, coasting on resilience, and clearing the residual at the end. As a numerical case, three unit buys spaced by $\tau=(\ln 2)/\rho$ leave cumulative impact $\kappa(1+\half+\tfrac14)=\tfrac74\kappa$ just after the third trade, against $3\kappa$ with no decay, the resilience having absorbed the rest.

#Order-flow memory and the square-root law

The propagator Equation (22) generalises from a single decay rate to an arbitrary kernel, and that generalisation explains the empirical impact law the linear models miss. Write the price as the sum of past trades filtered through a decaying kernel,

p_t=p_0+\sum_{s<t}G(t-s)\,\varepsilon_s+\text{noise}, \tag{23}

where $\varepsilon_s\in\{-1,+1\}$ is the sign of the trade at $s$ and $G(\tau)\propto\tau^{-\beta}$ decays as a power law. The order-flow signs are not independent, since splitting and herding give them long memory, an autocorrelation $C(\ell)=\E[\varepsilon_t\varepsilon_{t-\ell}]\propto\ell^{-\gamma}$ with $0<\gamma<1$ , so a buy is followed by more buys for a long time.

Persistent flow through a fixed kernel would make the price predictable, which arbitrage forbids, so the kernel decay and the flow memory are locked together. Requiring price increments to be serially uncorrelated, the diffusive efficiency condition, forces

\beta=\frac{1-\gamma}{2}. \tag{24}

The argument is a scaling balance, since the autocovariance of price increments pairs the kernel response $G$ against the flow correlation $C$ , and the two power laws cancel to leading order only at this exponent, the delicate balance that keeps a market made of autocorrelated orders statistically efficient [11]. The same balance produces the square-root law, that the impact of a metaorder of size $Q$ grows as

\Delta P\approx Y\,\sigma\sqrt{Q/\mathcal V}, \tag{25}

concave rather than linear against the daily volume $\mathcal V$ , because the early child orders of the metaorder are partially relaxed by the power-law kernel before the later ones land, so doubling the size less than doubles the impact. The concavity is a consequence of the decaying propagator and the long memory of flow rather than an independent assumption, while the precise one-half exponent is the robust empirical value that the locally linear latent-supply picture reproduces, and the prefactor $Y$ is genuinely empirical and of order one. That is the honest status of the law, a concave shape that follows from order-flow memory and resilience, sitting on a measured exponent and prefactor.

#Summary of results

Each model is a statement about the same object under its own assumptions.

Result	Statement	Key assumption
Efficient price	$S^\ast_t=\E[V\mid\Filt_t]$ is a martingale	$V\in L^1$
Roll spread	$\Cov(\Delta p_t,\Delta p_{t-1})=-s^2/4$	I.i.d. symmetric sides, independent of value
Glosten-Milgrom	Spread $=\pi(V_H-V_L)$	Sequential unit trades, competitive maker
Fill probability	$(\mu/(\mu+\kappa))^q$	Constant rates, execution-only advance
Mid move	$h(n)=(\rho^n-\rho^N)/(1-\rho^N)$	Birth-death queue, embedded jump chain
Microprice	$I A+(1-I)B$ to first order	Symmetric tick moves, imbalance state
Kyle impact	$\lambda=\sqrt{\Sigma_0}/(2\sigma_u)$	Gaussian value and noise, linear conjecture
Reservation price	$r=s-q\gamma\sigma^2(T-t)$	CARA maker, Gaussian mid
Execution	$x(t)=X\sinh(\theta(T-t))/\sinh(\theta T)$	Linear temporary impact, mean-variance
Transient impact	$I_t=\int_0^t\kappa e^{-\rho(t-s)}\dot v_s\,ds$	Exponential resilience, convex kernel
Square-root law	$\Delta P\propto\sqrt{Q}$	Power-law propagator, long-memory flow

#Numerical illustration

Roll's identity Equation (3) is exact in expectation, so a long simulated print series recovers the spread it was given from the first serial covariance alone.

import numpy as np
from numpy.random import Generator


def simulate_prints(
    sigma: float, spread: float, n: int, rng: Generator
) -> np.ndarray:
    """Simulate transaction prints under Roll's bid-ask bounce model.

    Args:
        sigma: Standard deviation of the efficient-price innovations.
        spread: Effective spread; trades print at the efficient price plus or
            minus half the spread.
        n: Number of prints.
        rng: Seeded generator for reproducibility.

    Returns:
        The transaction price series of length n.
    """
    efficient = np.cumsum(rng.normal(0.0, sigma, size=n))
    side = rng.choice([-1.0, 1.0], size=n)
    return efficient + 0.5 * spread * side


def estimate_spread(prints: np.ndarray) -> float:
    """Recover the effective spread from the first serial covariance.

    Args:
        prints: A transaction price series.

    Returns:
        The Roll estimate 2 * sqrt(-cov) of the spread when the lag-one
        covariance of price changes is negative, and zero otherwise.
    """
    changes = np.diff(prints)
    cov = np.cov(changes[1:], changes[:-1])[0, 1]
    return 2.0 * np.sqrt(-cov) if cov < 0.0 else 0.0


rng = np.random.default_rng(0)
prints = simulate_prints(sigma=0.01, spread=0.05, n=500_000, rng=rng)
recovered = estimate_spread(prints)

The efficient price is the martingale the order book tracks. The spread is the cost of the bounce and of adverse selection, the queue sets who is filled and which way the price tips, the microprice corrects the mid for imbalance, Kyle's slope turns information into impact, and the execution trajectory pays that impact back at the least total cost. Each is a theorem about the same object, a priority queue carrying information.

[1]

R. Roll, “A simple implicit measure of the effective bid-ask spread in an efficient market,” The Journal of Finance, vol. 39, no. 4, pp. 1127–1139, 1984.

[2]

L. R. Glosten and P. R. Milgrom, “Bid, ask and transaction prices in a specialist market with heterogeneously informed traders,” Journal of Financial Economics, vol. 14, no. 1, pp. 71–100, 1985.

[3]

R. Cont, S. Stoikov, and R. Talreja, “A stochastic model for order book dynamics,” Operations Research, vol. 58, no. 3, pp. 549–563, 2010.

[4]

S. Stoikov, “The micro-price: a high-frequency estimator of future prices,” Quantitative Finance, vol. 18, no. 12, pp. 1959–1966, 2018.

[5]

A. S. Kyle, “Continuous auctions and insider trading,” Econometrica, vol. 53, no. 6, pp. 1315–1335, 1985.

[6]

T. Ho and H. R. Stoll, “Optimal dealer pricing under transactions and return uncertainty,” Journal of Financial Economics, vol. 9, no. 1, pp. 47–73, 1981.

[7]

M. Avellaneda and S. Stoikov, “High-frequency trading in a limit order book,” Quantitative Finance, vol. 8, no. 3, pp. 217–224, 2008.

[8]

J.-P. Bouchaud, J. Bonart, J. Donier, and M. Gould, Trades, Quotes and Prices: Financial Markets Under the Microscope. Cambridge University Press, 2018.

[9]

A. A. Obizhaeva and J. Wang, “Optimal trading strategy and supply/demand dynamics,” Journal of Financial Markets, vol. 16, no. 1, pp. 1–32, 2013.

[10]

J. Gatheral, “No-dynamic-arbitrage and market impact,” Quantitative Finance, vol. 10, no. 7, pp. 749–759, 2010.

[11]

J.-P. Bouchaud, Y. Gefen, M. Potters, and M. Wyart, “Fluctuations and response in financial markets: the subtle nature of random price changes,” Quantitative Finance, vol. 4, no. 2, pp. 176–190, 2004.

Explore connections

see in the atlas

referenced by (2)

cite

@misc{price-formation-in-the-order-book,
  author = {Zac Kienzle},
  title  = {Price Formation in the Order Book},
  year   = {2026},
  month  = {05},
  url    = {https://zackienzle.com/blog/price-formation-in-the-order-book}
}