Bias in Estimating Accuracy of a Binary Screening Test with Differential Disease Verification
Overview
Affiliations
Sensitivity, specificity, positive and negative predictive value are typically used to quantify the accuracy of a binary screening test. In some studies, it may not be ethical or feasible to obtain definitive disease ascertainment for all subjects using a gold standard test. When a gold standard test cannot be used, an imperfect reference test that is less than 100 per cent sensitive and specific may be used instead. In breast cancer screening, for example, follow-up for cancer diagnosis is used as an imperfect reference test for women where it is not possible to obtain gold standard results. This incomplete ascertainment of true disease, or differential disease verification, can result in biased estimates of accuracy. In this paper, we derive the apparent accuracy values for studies subject to differential verification. We determine how the bias is affected by the accuracy of the imperfect reference test, the percent who receive the imperfect reference standard test not receiving the gold standard, the prevalence of the disease, and the correlation between the results for the screening test and the imperfect reference test. It is shown that designs with differential disease verification can yield biased estimates of accuracy. Estimates of sensitivity in cancer screening trials may be substantially biased. However, careful design decisions, including selection of the imperfect reference test, can help to minimize bias. A hypothetical breast cancer screening study is used to illustrate the problem.
Chubak J, Burnett-Hartman A, Barlow W, Corley D, Croswell J, Neslund-Dudas C Cancer Epidemiol Biomarkers Prev. 2022; 31(8):1517-1520.
PMID: 35916602 PMC: 9484579. DOI: 10.1158/1055-9965.EPI-22-0232.
Day E, Eldred-Evans D, Prevost A, Ahmed H, Fiorentino F BMC Med Res Methodol. 2022; 22(1):70.
PMID: 35300611 PMC: 8932251. DOI: 10.1186/s12874-021-01481-w.
Umemneku Chikere C, Wilson K, Graziadio S, Vale L, Allen A PLoS One. 2019; 14(10):e0223832.
PMID: 31603953 PMC: 6788703. DOI: 10.1371/journal.pone.0223832.
Anticipating missing reference standard data when planning diagnostic accuracy studies.
Naaktgeboren C, de Groot J, Rutjes A, Bossuyt P, Reitsma J, Moons K BMJ. 2016; 352:i402.
PMID: 26861453 PMC: 4772780. DOI: 10.1136/bmj.i402.
Exploring the Underdiagnosis and Prevalence of Autism Spectrum Conditions in Beijing.
Sun X, Allison C, Matthews F, Zhang Z, Auyeung B, Baron-Cohen S Autism Res. 2015; 8(3):250-60.
PMID: 25952676 PMC: 4690159. DOI: 10.1002/aur.1441.