The Neyman-Pearson lemma says that the likelihood ratio test is the uniformly most powerful test at a given sample size. But the likelihood ratio is a nonnegative supermartingale, hence an e-process, so it provides type-I error control at all times, and is a sequential test. How does it perform as such?

Let be a stream of data and let be the likelihood ratio at time for a simple alternative with density against a simple null with density . Suppose you have desired type-I error of (false positives) and desired type-II error of (false negatives). Then there exist thresholds and such that if you reject the null when and reject the alternative when such that the test is optimal. Here optimality means that, among all tests with the same (or lower) type-I and type-II error rates, the SPRT requires the fewest number of samples in expectation.

Wald introduced this test in the 1930s, and it was proved optimal by Wald and Wolfowitz in 1948.

Unfortunately, and are often impossible to solve for, so approximations are necessary. One usually sets and . Note that if we set , meaning we want a power 1 test (rejects the null with probability 1 in the limit under the alternative), then we recover the threshold for rejecting the null, which is what one would usually obtain for sequential testing with Ville’s inequality.

Reading