» Articles » PMID: 18258670

Bias in Sensitivity and Specificity Caused by Data-driven Selection of Optimal Cutoff Values: Mechanisms, Magnitude, and Solutions

Overview
Journal Clin Chem
Specialty Biochemistry
Date 2008 Feb 9
PMID 18258670
Citations 125
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Optimal cutoff values for tests results involving continuous variables are often derived in a data-driven way. This approach, however, may lead to overly optimistic measures of diagnostic accuracy. We evaluated the magnitude of the bias in sensitivity and specificity associated with data-driven selection of cutoff values and examined potential solutions to reduce this bias.

Methods: Different sample sizes, distributions, and prevalences were used in a simulation study. We compared data-driven estimates of accuracy based on the Youden index with the true values and calculated the median bias. Three alternative approaches (assuming a specific distribution, leave-one-out, smoothed ROC curve) were examined for their ability to reduce this bias.

Results: The magnitude of bias caused by data-driven optimization of cutoff values was inversely related to sample size. If the true values for sensitivity and specificity are both 84%, the estimates in studies with a sample size of 40 will be approximately 90%. If the sample size increases to 200, the estimates will be 86%. The distribution of the test results had little impact on the amount of bias when sample size was held constant. More robust methods of optimizing cutoff values were less prone to bias, but the performance deteriorated if the underlying assumptions were not met.

Conclusions: Data-driven selection of the optimal cutoff value can lead to overly optimistic estimates of sensitivity and specificity, especially in small studies. Alternative methods can reduce this bias, but finding robust estimates for cutoff values and accuracy requires considerable sample sizes.

Citing Articles

The use of ultrasonography in the transition period to estimate adipose tissue depots and their association with risk of early postpartum hyperketonemia in Holstein dairy cattle.

Westhoff T, Rodger M, Wieland M, Harper L, Stabell A, Van Althuis M JDS Commun. 2025; 6(1):125-130.

PMID: 39877169 PMC: 11770301. DOI: 10.3168/jdsc.2024-0602.


A unified framework for diagnostic test development and evaluation during outbreaks of emerging infections.

Chaturvedi M, Koster D, Bossuyt P, Gerke O, Jurke A, Kretzschmar M Commun Med (Lond). 2024; 4(1):263.

PMID: 39658579 PMC: 11632097. DOI: 10.1038/s43856-024-00691-9.


Comparison of radial immunodiffusion, turbidimetric immunoassay, and Brix refractometry for determining bovine colostrum quality.

Westhoff T, Behling-Kelly E, Mann S JDS Commun. 2024; 5(6):679-683.

PMID: 39650042 PMC: 11624326. DOI: 10.3168/jdsc.2024-0604.


Data-Driven Cutoff Selection for the Patient Health Questionnaire-9 Depression Screening Tool.

Levis B, Bhandari P, Neupane D, Fan S, Sun Y, He C JAMA Netw Open. 2024; 7(11):e2429630.

PMID: 39576645 PMC: 11584932. DOI: 10.1001/jamanetworkopen.2024.29630.


Hormone receptor mRNA and protein levels as predictors of premenopausal tamoxifen benefit.

Engstrom T, Ekholm M, Ferno M, Lundgren C, Nordenskjold B, Stal O Acta Oncol. 2024; 63:125-136.

PMID: 38587062 PMC: 11332536. DOI: 10.2340/1651-226X.2024.19655.