» Articles » PMID: 18224558

McNemar Chi2 Test Revisited: Comparing Sensitivity and Specificity of Diagnostic Examinations

Overview
Publisher Informa Healthcare
Specialty Science
Date 2008 Jan 29
PMID 18224558
Citations 84
Authors
Affiliations
Soon will be listed here.
Abstract

When evaluating a novel diagnostic examination for clinical use, it should be compared with a reference standard, defined as the best available examination, which may include clinical and laboratory criteria. The novel examination and reference standard's results are usually presented in the form of a 2 x 2 table, which allows calculation of sensitivity, specificity and accuracy. It has been recommended that the measures of statistical uncertainty should be reported, such as the 95% confidence interval, when evaluating the accuracy of diagnostic examinations. Comparing the difference in sensitivity or specificity of a novel examination with the reference standard is important when evaluating its usefulness. The McNemar chi(2) test, used to compare discordance of two dichotomous responses, can be applied for this purpose. However, applying the McNemar test to a 2 x 2 table for comparing the accuracy of examinations is not recommended, since this test is sensitive to the proportion of positive versus negative subjects. Moreover, if the novel examination has higher sensitivity than the one considered as the reference standard, constructing a classic 2 x 2 table would result in low specificity of the novel examination. Thus, in order to compare sensitivities and specificities between examinations, this table is inappropriate and an independent reference standard is necessary. In this article, we propose the use of the McNemar chi(2) test to compare sensitivities between examinations using a 2 x 2 table exclusively among diseased patients, defined by a set of criteria and follow-up of patients. Likewise, specificities can be compared applying the McNemar test among healthy individuals.

Citing Articles

A Digital Parenting Intervention With Intimate Partner Violence Prevention Content: Quantitative Pre-Post Pilot Study.

Schafer M, Lachman J, Zinser P, Calderon Alfaro F, Han Q, Facciola C JMIR Form Res. 2025; 9():e58611.

PMID: 39753219 PMC: 11748420. DOI: 10.2196/58611.


Evaluating SARC-F, SARC-CalF, and calf circumference as diagnostic tools for sarcopenia in Thai older adults: results from a nationwide study.

Vanitcharoenkul E, Unnanuntana A, Chotiyarnwong P, Adulkasem N, Asavamongkolkul A, Laohaprasitiporn P BMC Geriatr. 2024; 24(1):1043.

PMID: 39732657 PMC: 11681680. DOI: 10.1186/s12877-024-05637-3.


Advancing poliovirus eradication: lessons learned from piloting direct molecular detection of polioviruses in high-risk and priority geographies.

Marcet P, Short B, Deas A, Sun H, Harrington C, Shaukat S Microbiol Spectr. 2024; 13(2):e0227924.

PMID: 39665559 PMC: 11792523. DOI: 10.1128/spectrum.02279-24.


Comparison of AI-integrated pathways with human-AI interaction in population mammographic screening for breast cancer.

Frazer H, Pena-Solorzano C, Kwok C, Elliott M, Chen Y, Wang C Nat Commun. 2024; 15(1):7525.

PMID: 39214982 PMC: 11364867. DOI: 10.1038/s41467-024-51725-8.


Development and Clinical Evaluation of a CRISPR/Cas12a-Based Nucleic Acid Detection Platform for the Diagnosis of Keratomycoses.

Deivarajan H, Elamurugan V, Sivashanmugam P, Pandian J, Sevugamurthi K, Rameshkumar G Ophthalmol Sci. 2024; 4(5):100522.

PMID: 38881611 PMC: 11179415. DOI: 10.1016/j.xops.2024.100522.