False coverage rate: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
AnomieBOT (talk | contribs)
m Dating maintenance tags: {{Cn}}
Amyxz (talk | contribs)
latex-ed some of the equations and tried to make the writing more clear for the beginning of the article.
Line 3: Line 3:
The FCR gives a simultaneous coverage at a (1-α)% level for all of the parameters considered in the problem. The FCR has a strong connection to the [[False Discovery Rate]] (FDR). Both methods address the Problem of [[multiple comparisons]], FCR from CI's ([[confidence intervals]])and FDR from P value's point if view.
The FCR gives a simultaneous coverage at a (1-α)% level for all of the parameters considered in the problem. The FCR has a strong connection to the [[False Discovery Rate]] (FDR). Both methods address the Problem of [[multiple comparisons]], FCR from CI's ([[confidence intervals]])and FDR from P value's point if view.


FCR was needed as an answer for dangers caused by selective inference: reporting, attending or highlighting just what we choose , as many researchers and scientists tend to put in their article only what was significant. There are many FCR procedures offered among them: Bonferroni-Selected–Bonferroni-Adjusted {{cn|date=August 2012}}, Adjusted BH-Selected CIs (Benjamini and Yekutieli (2005){{cn|date=August 2012}}), Bayes FCR (Yekutieli (2008){{cn|date=August 2012}}). The incentive of choosing one procedure over another is the length of the CI we will want it to be narrow as possible and to keep the FCR. For [[microarray]] experiments and other modern applications, there are a huge number of parameters, often tens of thousands or more and it is very important to choose the most powerful procedure.
FCR was needed because of dangers caused by selective inference. Researchers and scientists tend to report or highlight only the portion of data that is considered significant. It is therefore necessary to understand how the data is falsely covered. There are many FCR procedures which can be used depending on the length of the CI - Bonferroni-Selected–Bonferroni-Adjusted {{cn|date=August 2012}}, Adjusted BH-Selected CIs (Benjamini and Yekutieli (2005){{cn|date=August 2012}}), Bayes FCR (Yekutieli (2008){{cn|date=August 2012}}). The incentive of choosing one procedure over another is to ensure that the CI is as narrow as possible and to keep the FCR. For [[microarray]] experiments and other modern applications, there are a huge number of [[Statistical parameter|parameters]], often tens of thousands or more and it is very important to choose the most powerful procedure.


==Definitions==
==Definitions==


Not keeping the FCR means FCR>q when q= V/R = α*m0/R, m0=number of true null hypothesis and R=number of rejected hypothesis. Intervals with simultaneous coverage probability 1q can control the FCR to be bounded by q.
Not keeping the FCR means <math>FCR>q</math> when <math>q= \frac{V}{R} = \frac{\alpha*m_0}{R} </math>, where <math>m_0 </math> is the number of true null hypotheses and <math>R </math> is the number of rejected hypothesis. Intervals with simultaneous coverage probability <math>1-q </math> can control the FCR to be bounded by <math>q</math>.


==The problems addressed by FCR==
==The problems addressed by FCR==
===Selection===
1. The problem caused by selection: Selection causes reduced average coverage. Selection can be presented as conditioning on an event defined by the data and may affect the coverage probability of a CI for a single parameter. Equivalently, the problem of selection changes the basic sense of P-values which is solved with FDR procedures. P-values and Confidence Intervals also known to us as equivalent rules for rejection or acceptance of the null hypothesis. Solution: the goal of conditional coverage following any selection rule for any set of (unknown) values for the parameters is impossible to achieve. A weaker property when it comes to selective CIs may be achieved: the obstacle to avoid is that of making a false coverage statement. FCR can be recognized as a measure of interval coverage following selection. Therefore, even though a 1 − α CI does not offer selective (conditional)coverage, the probability of constructing a no covering CI is at most α:
[[Selection bias|Selection]] causes reduced average coverage. Selection can be presented as conditioning on an event defined by the data and may affect the coverage probability of a CI for a single [[Statistical parameter|parameter]]. Equivalently, the problem of selection changes the basic sense of [[P-values]]. FCR procedures consider that the goal of conditional coverage following any selection rule for any set of (unknown) values for the parameters is impossible to achieve. A weaker property when it comes to selective CIs is possible and will avoid false coverage statements. FCR is a measure of interval coverage following selection. Therefore, even though a 1−α CI does not offer selective ([[Conditional probability|conditional]]) coverage, the probability of constructing a no covering CI is at most α, where


Pr {θ /∈ CI, CI constructed} Pr {θ /∈ CI} α.
:<math>Pr[\theta \in CI, CI constructed] \leq Pr[\theta \in CI] \leq \alpha</math>


===Selection and Multiplicity===
2. The problem caused by combination of selection and multiplicity: When facing both multiplicity (inference about multiple parameters) and selection, not only is the expected proportion of coverage over selected parameters at 1−α not equivalent to the expected proportion of no coverage at α, but also the latter no longer can be ensured by constructing marginal CIs for each selected parameter. Solution: taking the expected proportion of parameters not covered by their CIs among the selected parameters, where the proportion is 0 if no parameter is selected. This false coverage-statement rate (FCR) is a property of any procedure that is defined by the way in which parameters are selected and the way in which the multiple intervals are constructed.
When facing both multiplicity (inference about multiple parameters) and [[Selection bias|selection]], not only is the expected proportion of coverage over selected parameters at 1−α not equivalent to the expected proportion of no coverage at α, but also the latter can no longer be ensured by constructing marginal CIs for each selected parameter. FCR procedures solve this by taking the expected proportion of parameters not covered by their CIs among the selected parameters, where the proportion is 0 if no parameter is selected. This false coverage-statement rate (FCR) is a property of any procedure that is defined by the way in which parameters are selected and the way in which the multiple intervals are constructed.


==Procedures==
==Procedures==

Revision as of 15:29, 21 August 2012

The False Coverage Rate(FCR) indicates the average rate of false coverage, namely, not covering the true parameters, among the selected intervals.

The FCR gives a simultaneous coverage at a (1-α)% level for all of the parameters considered in the problem. The FCR has a strong connection to the False Discovery Rate (FDR). Both methods address the Problem of multiple comparisons, FCR from CI's (confidence intervals)and FDR from P value's point if view.

FCR was needed because of dangers caused by selective inference. Researchers and scientists tend to report or highlight only the portion of data that is considered significant. It is therefore necessary to understand how the data is falsely covered. There are many FCR procedures which can be used depending on the length of the CI - Bonferroni-Selected–Bonferroni-Adjusted [citation needed], Adjusted BH-Selected CIs (Benjamini and Yekutieli (2005)[citation needed]), Bayes FCR (Yekutieli (2008)[citation needed]). The incentive of choosing one procedure over another is to ensure that the CI is as narrow as possible and to keep the FCR. For microarray experiments and other modern applications, there are a huge number of parameters, often tens of thousands or more and it is very important to choose the most powerful procedure.

Definitions

Not keeping the FCR means when , where is the number of true null hypotheses and is the number of rejected hypothesis. Intervals with simultaneous coverage probability can control the FCR to be bounded by .

The problems addressed by FCR

Selection

Selection causes reduced average coverage. Selection can be presented as conditioning on an event defined by the data and may affect the coverage probability of a CI for a single parameter. Equivalently, the problem of selection changes the basic sense of P-values. FCR procedures consider that the goal of conditional coverage following any selection rule for any set of (unknown) values for the parameters is impossible to achieve. A weaker property when it comes to selective CIs is possible and will avoid false coverage statements. FCR is a measure of interval coverage following selection. Therefore, even though a 1−α CI does not offer selective (conditional) coverage, the probability of constructing a no covering CI is at most α, where

Selection and Multiplicity

When facing both multiplicity (inference about multiple parameters) and selection, not only is the expected proportion of coverage over selected parameters at 1−α not equivalent to the expected proportion of no coverage at α, but also the latter can no longer be ensured by constructing marginal CIs for each selected parameter. FCR procedures solve this by taking the expected proportion of parameters not covered by their CIs among the selected parameters, where the proportion is 0 if no parameter is selected. This false coverage-statement rate (FCR) is a property of any procedure that is defined by the way in which parameters are selected and the way in which the multiple intervals are constructed.

Procedures

Bonferroni procedure ( Bonferroni-Selected–Bonferroni-Adjusted) for simultaneous CI

Simultaneous CIs with Bonferroni procedure when we have m parameters, each marginal CI constructed at the 1 − α/m level. Without selection, these CIs offer simultaneous coverage, in the sense that the probability that all CIs cover their respective parameters is at least 1 − α. unfortunately, even such a strong property does not ensure the conditional confidence property following selection.


FCR for Bonferroni-Selected–Bonferroni-Adjusted simultaneous CI

The Bonferroni–Bonferroni procedure cannot offer conditional coverage, however it does control the FCR at <α In fact it does so too well, in the sense that the FCR is much too close to 0 for large values of θ Intervals selection is based on Bonferroni testing, and Bonferroni CIs are then constructed. The FCR is estimated as, the proportion of intervals failing to cover their respective parameters among the constructed CIs is calculated (setting the proportion to 0 when none are selected). Where selection is based on unadjusted individual testing and unadjusted CIs are constructed.

FCR-Adjusted BH-Selected CIs

In BH Procedure for FDR after sorting the p values P(1) ≤• • •≤ P(m) and calculating R = max{ j :P( j) ≤ j • q/m}, the R null hypotheses for which P(i) ≤ R • q/m are rejected. If testing is done using the Bonferroni procedure, then the lower bound of the FCR may drop well below the desired level q, implying that the intervals are too long. In contrast, applying the following procedure, which combines the general procedure with the FDR controlling testing in the BH procedure, also yields a lower bound for the FCR, q/2 ≤ FCR. This procedure is sharp in the sense that for some configurations, the FCR approaches q.

1. Sort the p values used for testing the m hypotheses regarding the parameters,P(1) ≤• • •≤P(m).

2. Calculate R = max{i :P(i) ≤ i • q/m}.

3. Select the R parameters for which P(i) ≤ R • q/m, corresponding to the rejected hypotheses.

4. Construct a 1− R • q/m CI for each parameter selected.

See also

Key concepts

References

Footnotes

Sources

  • Benjamini, Yoav; Yekutieli, Daniel (March 2005). "False Discovery Rate–Adjusted Multiple Confidence Intervals for Selected Parameters" (pdf). Journal of the American Statistical Association. 100 (469): 71–93. doi:10.1198/016214504000001907.
  • Zhao, Zhigen; Hwang, J. T. Gene (2012). "Empirical Bayes false coverage rate controlling confidence intervals" (pdf). Journal of the Royal Statistical Society: Series B (Statistical Methodology). doi:10.1111/j.1467-9868.2012.01033.x.