» Articles » PMID: 21495059

Bias in Estimating Accuracy of a Binary Screening Test with Differential Disease Verification

Overview
Journal Stat Med
Publisher Wiley
Specialty Public Health
Date 2011 Apr 16
PMID 21495059
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Sensitivity, specificity, positive and negative predictive value are typically used to quantify the accuracy of a binary screening test. In some studies, it may not be ethical or feasible to obtain definitive disease ascertainment for all subjects using a gold standard test. When a gold standard test cannot be used, an imperfect reference test that is less than 100 per cent sensitive and specific may be used instead. In breast cancer screening, for example, follow-up for cancer diagnosis is used as an imperfect reference test for women where it is not possible to obtain gold standard results. This incomplete ascertainment of true disease, or differential disease verification, can result in biased estimates of accuracy. In this paper, we derive the apparent accuracy values for studies subject to differential verification. We determine how the bias is affected by the accuracy of the imperfect reference test, the percent who receive the imperfect reference standard test not receiving the gold standard, the prevalence of the disease, and the correlation between the results for the screening test and the imperfect reference test. It is shown that designs with differential disease verification can yield biased estimates of accuracy. Estimates of sensitivity in cancer screening trials may be substantially biased. However, careful design decisions, including selection of the imperfect reference test, can help to minimize bias. A hypothetical breast cancer screening study is used to illustrate the problem.

Citing Articles

Estimating Cancer Screening Sensitivity and Specificity Using Healthcare Utilization Data: Defining the Accuracy Assessment Interval.

Chubak J, Burnett-Hartman A, Barlow W, Corley D, Croswell J, Neslund-Dudas C Cancer Epidemiol Biomarkers Prev. 2022; 31(8):1517-1520.

PMID: 35916602 PMC: 9484579. DOI: 10.1158/1055-9965.EPI-22-0232.


Adjusting for verification bias in diagnostic accuracy measures when comparing multiple screening tests - an application to the IP1-PROSTAGRAM study.

Day E, Eldred-Evans D, Prevost A, Ahmed H, Fiorentino F BMC Med Res Methodol. 2022; 22(1):70.

PMID: 35300611 PMC: 8932251. DOI: 10.1186/s12874-021-01481-w.


Diagnostic test evaluation methodology: A systematic review of methods employed to evaluate diagnostic tests in the absence of gold standard - An update.

Umemneku Chikere C, Wilson K, Graziadio S, Vale L, Allen A PLoS One. 2019; 14(10):e0223832.

PMID: 31603953 PMC: 6788703. DOI: 10.1371/journal.pone.0223832.


Anticipating missing reference standard data when planning diagnostic accuracy studies.

Naaktgeboren C, de Groot J, Rutjes A, Bossuyt P, Reitsma J, Moons K BMJ. 2016; 352:i402.

PMID: 26861453 PMC: 4772780. DOI: 10.1136/bmj.i402.


Exploring the Underdiagnosis and Prevalence of Autism Spectrum Conditions in Beijing.

Sun X, Allison C, Matthews F, Zhang Z, Auyeung B, Baron-Cohen S Autism Res. 2015; 8(3):250-60.

PMID: 25952676 PMC: 4690159. DOI: 10.1002/aur.1441.


References
1.
Pisano E, Gatsonis C, Hendrick E, Yaffe M, Baum J, Acharyya S . Diagnostic performance of digital versus film mammography for breast-cancer screening. N Engl J Med. 2005; 353(17):1773-83. DOI: 10.1056/NEJMoa052911. View

2.
Reitsma J, Rutjes A, Khan K, Coomarasamy A, Bossuyt P . A review of solutions for diagnostic accuracy studies with an imperfect or missing reference standard. J Clin Epidemiol. 2009; 62(8):797-806. DOI: 10.1016/j.jclinepi.2009.02.005. View

3.
Zhou X . Correcting for verification bias in studies of a diagnostic test's accuracy. Stat Methods Med Res. 1999; 7(4):337-53. DOI: 10.1177/096228029800700403. View

4.
Panzer R, Suchman A, Griner P . Workup bias in prediction research. Med Decis Making. 1987; 7(2):115-9. DOI: 10.1177/0272989X8700700209. View

5.
Skaane P, Hofvind S, Skjennald A . Randomized trial of screen-film versus full-field digital mammography with soft-copy reading in population-based screening program: follow-up and final results of Oslo II study. Radiology. 2007; 244(3):708-17. DOI: 10.1148/radiol.2443061478. View