User:Zaqrfv/draft
- This page is a new version of Neyman-Pearson lemma, still under construction. Mainly, still needs citations, links etc. Comments specific to this rewrite can be added here, or generally related to NP lemma, here.
The Neyman-Pearson Lemma establishes that the likelihood ratio test is the most powerful procedure for simple-vs-simple statistical hypotheses testing.
The Neyman-Pearson lemma is not directly applicable to composite hypotheses. However, in certain special cases, the lemma provides the same optimal test for each member of a composite alternative, and in this case, the test is Uniformly most powerful.
Statement of the Lemma
[edit]In a simple-vs-simple (or point-vs-point) hypotheses testing problem, the distribution of observed data X is fully specified, with no unknown parameters, by both the null and alternative hypothesis. Here, X may represent a single random observation, a vector of observations, or (using measure-theoretic definitions of densities) any other type of random data.
Mathematically, let be the density function of X, and suppose that one wishes to test the null hypothesis against the alternative hypothesis . The Neyman-Pearson test compares the ratio of the two densities:
Accept if .
Reject if .
The critical value c is chosen so as to obtain a specified significance level .
Note that for discrete data, and are mass functions. In this case one has to be more careful about - see the section on critical functions below.
The Neyman-Pearson Lemma states that, among all possible tests with significance level , the Neyman-Pearson (or Likelihood Ratio) test as defined above is the most powerful test. That is, it has the greatest probability of rejecting the null hypothesis, and accepting the alternative, when the alternative hypothesis is the correct hypothesis.
Neyman-Pearson Lemma in terms of critical functions.
[edit]In statistical hypotheses testing, a critical function[1] represents the probability, as a function of the observation X, that the null hypothesis is rejected. Usually, one considers only non-randomized tests (i.e. tests that depend only on the observations X), and the critical function takes only the values 0 and 1 (corresponding to accepting or rejecting the null hypothesis). In a randomized test, it may take any value between 0 and 1. The advantage of introducing critical functions and randomized tests in the present context is that it allows significance levels to be set exactly for discrete problems, such as Poisson distributions.
In terms of critical functions, the significance level of the test is , while the power is , where and denote expectation with repect to the hypothesized distributions.
The critical function for the Neyman-Peason LRT with specified significance level is
where p and c are determined by the requirement (in most cases with continuous data, p is arbitrary).
The Neyman-Pearson Lemma can be stated as follows: Let be the critical function for the level- likelihood ratio test, and be the critical function for any other level- test (i.e. ). Then
Example: Poisson distribution.
[edit]Suppose that X is a single observation from a Poisson distribution with mean , and one wishes to test the hypothesis against the alternative . Suppose that .
The likelihood ratio test statistic is
This is a monotone increasing function of X, and so the Neyman-Pearson test rejects for large values of X. The critical value is determined by the specified significance level .
To be specific, suppose , , and . Poisson tables show that if the null hypothesis is true, , and . The Neyman-Pearson test rejects whenever X>8, and with probability p when X=8. p is determined by the equation
or p=0.4888. The critical function for the Neyman-Pearson test is
Following this derivation carefully, one notes that the rule depends only on . The value of plays no role, beyond the requirement . It follows that this test is the Neyman-Pearson test for any , and is therefore uniformly most powerful for the composite alternative .
Example: Relation to Investment
[edit]Suppose that one has $20 to invest. There are seven possible investments (denoted by A to G), with costs and return as follows:
Investment | Cost | Return | Ret/Cost |
---|---|---|---|
A | 5 | 8 | 1.6 |
B | 10 | 12 | 1.2 |
C | 10 | 20 | 2.0 |
D | 15 | 27 | 1.8 |
E | 20 | 6 | 0.3 |
F | 20 | 12 | 0.6 |
G | 20 | 15 | 0.75 |
Total | 100 | 100 |
Obviously, investing the money in E, F or G is a bad strategy, since they're money losers. If the $20 is invested in B and C, the return is $12+$20=$32. If the investment is in A and D, the return is $35.
However, the best strategy for a $20 investment, if fractional shares are allowed, is C plus two-thirds of D. The return is then 20+⅔×27=38 dollars. Simply, C is the highest-returning investment (i.e. highest ratio), followed by D. The best strategy is to invest where the rates of return are highest.
This is exactly the Neyman-Pearson Lemma. Interpret A to G as the possible values of the random variable; the "cost/100" as the probability distribution under the null hypothesis, and "return/100" as the probability distribution under the alternative. The $20 to invest is the significance level, while the return of the chosen options is the power. The Neyman-Pearson lemma says to invest your money (or significance level) in the options where the rate-of-return (return/cost; or likelihood ratio) is highest.
Proof of the Neyman-Pearson Lemma
[edit]By definition of expectation,
Now, from the claim (see below) that for all x, , it follows that
which completes the result.
To establish the claim, consider x in three regions:
- For , and . The claim holds trivially.
- For , then both sides of the claim are equal.
- For , and . The claim still holds, since a negative number times `less than' yields a `greater than'.
Note: For discrete distributions, replace integrals by sums. More generally, measure theoreticians can use densities with respect to a measure , and integrate with respect to .
- ^ Lehman, E.L. (1986). Testing Statistical Hypotheses (Second ed.). New York: John Wiley and Sons. p. 71. ISBN 0 471-84083-1.