Avtor/Urednik     Lusa, Lara; McShane, Lisa M; Radmacher, Michael D; Shih, Joanna H; Wright, George W; Simon, Richard
Naslov     Appropriateness of some resampling-based inference procedures for assessing performance of prognostic classifiers derived from microarray data
Tip     članek
Vir     Stat Med
Vol. in št.     Letnik 26, št. 5
Leto izdaje     2007
Obseg     str. 1102-13
Jezik     eng
Abstrakt     The goal of many gene-expression microarray profiling clinical studies is to develop a multivariate classifier to predict patient disease outcome from a gene-expression profile measured on some biological specimen from the patient. Often some preliminary validation of the predictive power of a profile-based classifier is carried out using the same data set that was used to derive the classifier. Techniques such as cross-validation or bootstrapping can be used in this setting to assess predictive power, and if applied correctly, can result in a less biased estimate of predictive accuracy of a classifier. However, some investigators have attempted to apply standard statistical inference procedures to assess the statistical significance of associations between true and cross-validated predicted outcomes. We demonstrate in this paper that naive application of standard statistical inference procedures to these measures of association under null situations can result in greatly inflated testing type I error rates. Under alternatives of small to moderate associations, confidence interval coverage probabilities may be too low, although for very large associations coverage probabilities approach their intended values. Our results suggest that caution should be exercised in interpreting some of the claims of exceptional prognostic classifier performance that have been reported in prominent biomedical journals in the past few years.
Deskriptorji     DATA INTERPRETATION, STATISTICAL
CONFIDENCE INTERVALS
DNA FINGERPRINTING
ODDS RATIO