The Neyman-Pearson lemma says that the likelihood ratio test is the uniformly most powerful test at a given sample size. But the likelihood ratio is a nonnegative supermartingale, hence an e-process, so it provides type-I error control at all times, and is a sequential test. How does it perform as such?

Let $(X_{t})$ be a stream of data and let $L_{t} = \prod_{i \leq t} q (X_{1}) / p (X_{1})$ be the likelihood ratio at time $t$ for a simple alternative $Q$ with density $q$ against a simple null $P$ with density $p$ . Suppose you have desired type-I error of $α$ (false positives) and desired type-II error of $β$ (false negatives). Then there exist thresholds $t_{1} (α, β)$ and $t_{2} (α, β)$ such that if you reject the null when $L_{t} \geq t_{1} (α, β)$ and reject the alternative when $L_{t} \geq t_{2} (α, β)$ such that the test is optimal. Here optimality means that, among all tests with the same (or lower) type-I and type-II error rates, the SPRT requires the fewest number of samples in expectation.

Wald introduced this test in the 1930s, and it was proved optimal by Wald and Wolfowitz in 1948.

Unfortunately, $t_{1}$ and $t_{2}$ are often impossible to solve for, so approximations are necessary. One usually sets $t_{1} = (1 - β) / α$ and $t_{2} = β / (1 - α)$ . Note that if we set $β = 0$ , meaning we want a power 1 test (rejects the null with probability 1 in the limit under the alternative), then we recover the threshold $lo g (1/ α)$ for rejecting the null, which is what one would usually obtain for sequential testing with Ville’s inequality.

Reading

Optimum Character of the Sequential Probability Ratio Test.
Improving approximate SPRT: https://arxiv.org/pdf/2410.16076. When the distributions are discrete, the thresholds $t_{1} (α)$ and $t_{2} (β)$ may not be precisely attainable, in which case the test will be overly conservative.

The Stats Map

Explore

sequential probability ratio test

Reading

Backlinks

Graph View

Recently Updated

regret minimization

infinitely divisible distribution

best-arm identification

characteristic function

Bayes factors

adjusters

multi-armed bandits

Explore