ADVERTISEMENT

If you are seeing this message, you may be experiencing temporary network problems. Please wait a few minutes and refresh the page. If the problem persists, you may wish to report it to your local Network Manager.

It is also possible that your web browser is not configured or not able to display style sheets. In this case, although the visual presentation will be degraded, the site should continue to be functional. We recommend using the latest version of Microsoft or Mozilla web browser to help minimise these problems.

Wiley InterScience

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

Volume 69 Issue 3, Pages 347 - 368

Published Online: 22 May 2007

© 2010 The Royal Statistical Society and Blackwell Publishing Ltd



< Previous Abstract  |  Next Abstract >

Save Article to My Profile      Download Citation      Request Permissions

Abstract |  References  |  Full Text: HTML, PDF (Size: 707K)  | Related Articles | Citation Tracking

The optimal discovery procedure: a new approach to simultaneous significance testing
John D. Storey 1
  1 University of Washington, Seattle, USA
Correspondence to John D. Storey, Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.
E-mail: jstorey@u.washington.edu
Copyright 2007 Royal Statistical Society
KEYWORDS
Classification • False discovery rate • Multiple-hypothesis testing • Optimal discovery procedure • q-value • Single-thresholding procedure

ABSTRACT

Summary. The Neyman–Pearson lemma provides a simple procedure for optimally testing a single hypothesis when the null and alternative distributions are known. This result has played a major role in the development of significance testing strategies that are used in practice. Most of the work extending single-testing strategies to multiple tests has focused on formulating and estimating new types of significance measures, such as the false discovery rate. These methods tend to be based on p-values that are calculated from each test individually, ignoring information from the other tests. I show here that one can improve the overall performance of multiple significance tests by borrowing information across all the tests when assessing the relative significance of each one, rather than calculating p-values for each test individually. The 'optimal discovery procedure' is introduced, which shows how to maximize the number of expected true positive results for each fixed number of expected false positive results. The optimality that is achieved by this procedure is shown to be closely related to optimality in terms of the false discovery rate. The optimal discovery procedure motivates a new approach to testing multiple hypotheses, especially when the tests are related. As a simple example, a new simultaneous procedure for testing several normal means is defined; this is surprisingly demonstrated to outperform the optimal single-test procedure, showing that a method which is optimal for single tests may no longer be optimal for multiple tests. Connections to other concepts in statistics are discussed, including Stein's paradox, shrinkage estimation and the Bayesian approach to hypothesis testing.


[Received December 2005. Revised December 2006]

DIGITAL OBJECT IDENTIFIER (DOI)
10.1111/j.1467-9868.2007.005592.x About DOI

Related Articles

  • Find other articles like this in Wiley InterScience
  • Find articles in Wiley InterScience written by any of the authors

Wiley InterScience is a member of CrossRef.

Cross Ref Member


Also of Interest

Statistics

Wiley-Blackwell is the largest publisher of society-based statistics journals and No. 1 in terms of quality and international scope.

Wiley-Blackwell publishes 19 statistics journals and is now the top publisher of Thomson Reuters ranked statistics journals.

Discover more about the statistics portfolio

Hot Papers
RSS

Journal of the Royal Statistical Society

See the Papers attracting early citation:

Series A: Statistics in Society
A re-evaluation of random-effects meta-analysis

Series B: Statistical Methodology
Testing for lack of fit in inverse regression—with applications to biophotonic imaging

Series C: Applied Statistics
A multifaceted sensitivity analysis of the Slovenian public opinion survey data

Announcing
SIGN

Significance

2010 Crystal Ball Competition

Try to forecast the results of 10 different events, some sporting, some cultural, some just odd, that will take place between May and July 2010.
Cash prizes and books for winners.

Take part

Check out the rules

Have Fun!