» Articles » PMID: 31603953

Diagnostic Test Evaluation Methodology: A Systematic Review of Methods Employed to Evaluate Diagnostic Tests in the Absence of Gold Standard - An Update

Overview
Journal PLoS One
Date 2019 Oct 12
PMID 31603953
Citations 69
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: To systematically review methods developed and employed to evaluate the diagnostic accuracy of medical test when there is a missing or no gold standard.

Study Design And Settings: Articles that proposed or applied any methods to evaluate the diagnostic accuracy of medical test(s) in the absence of gold standard were reviewed. The protocol for this review was registered in PROSPERO (CRD42018089349).

Results: Identified methods were classified into four main groups: methods employed when there is a missing gold standard; correction methods (which make adjustment for an imperfect reference standard with known diagnostic accuracy measures); methods employed to evaluate a medical test using multiple imperfect reference standards; and other methods, like agreement studies, and a mixed group of alternative study designs. Fifty-one statistical methods were identified from the review that were developed to evaluate medical test(s) when the true disease status of some participants is unverified with the gold standard. Seven correction methods were identified and four methods were identified to evaluate medical test(s) using multiple imperfect reference standards. Flow-diagrams were developed to guide the selection of appropriate methods.

Conclusion: Various methods have been proposed to evaluate medical test(s) in the absence of a gold standard for some or all participants in a diagnostic accuracy study. These methods depend on the availability of the gold standard, its' application to the participants in the study and the availability of alternative reference standard(s). The clinical application of some of these methods, especially methods developed when there is missing gold standard is however limited. This may be due to the complexity of these methods and/or a disconnection between the fields of expertise of those who develop (e.g. mathematicians) and those who employ the methods (e.g. clinical researchers). This review aims to help close this gap with our classification and guidance tools.

Citing Articles

Validation of the alcohol use disorders identification test in a Danish hospital setting.

Scholer P, Andersen M, Andersen K, Becker U, Thiele M, Nielsen A Subst Abuse Treat Prev Policy. 2025; 20(1):7.

PMID: 39953621 PMC: 11829362. DOI: 10.1186/s13011-025-00638-w.


EM-AUC: A Novel Algorithm for Evaluating Anomaly Based Network Intrusion Detection Systems.

Bai K, Fossaceca J Sensors (Basel). 2025; 25(1.

PMID: 39796869 PMC: 11723195. DOI: 10.3390/s25010078.


Laboratory-based molecular test alternatives to RT-PCR for the diagnosis of SARS-CoV-2 infection.

Arevalo-Rodriguez I, Mateos-Haro M, Dinnes J, Ciapponi A, Davenport C, Buitrago-Garcia D Cochrane Database Syst Rev. 2024; 10:CD015618.

PMID: 39400904 PMC: 11472845. DOI: 10.1002/14651858.CD015618.


RT-qPCR Testing and Performance Metrics in the COVID-19 Era.

Bustin S Int J Mol Sci. 2024; 25(17).

PMID: 39273275 PMC: 11394961. DOI: 10.3390/ijms25179326.


A review of methods for the analysis of diagnostic tests performed in sequence.

Fanshawe T, Nicholson B, Perera R, Oke J Diagn Progn Res. 2024; 8(1):8.

PMID: 39223640 PMC: 11370044. DOI: 10.1186/s41512-024-00175-3.


References
1.
Burke W . Genetic tests: clinical validity and clinical utility. Curr Protoc Hum Genet. 2014; 81:9.15.1-9.15.8. PMC: 4084965. DOI: 10.1002/0471142905.hg0915s81. View

2.
Tang S, Hemyari P, Canchola J, Duncan J . Dual composite reference standards (dCRS) in molecular diagnostic research: A new approach to reduce bias in the presence of Imperfect reference. J Biopharm Stat. 2018; 28(5):951-965. DOI: 10.1080/10543406.2018.1428613. View

3.
Zhang Y, Alonzo T . Estimation of the volume under the receiver-operating characteristic surface adjusting for non-ignorable verification bias. Stat Methods Med Res. 2018; 27(3):715-739. DOI: 10.1177/0962280217742541. View

4.
Hsia E, Schluger N, Cush J, Chaisson R, Matteson E, Xu S . Interferon-γ release assay versus tuberculin skin test prior to treatment with golimumab, a human anti-tumor necrosis factor antibody, in patients with rheumatoid arthritis, psoriatic arthritis, or ankylosing spondylitis. Arthritis Rheum. 2012; 64(7):2068-77. DOI: 10.1002/art.34382. View

5.
Henkelman R, Kay I, Bronskill M . Receiver operator characteristic (ROC) analysis without truth. Med Decis Making. 1990; 10(1):24-9. DOI: 10.1177/0272989X9001000105. View