» Articles » PMID: 29020131

Data-Adaptive Estimation for Double-Robust Methods in Population-Based Cancer Epidemiology: Risk Differences for Lung Cancer Mortality by Emergency Presentation

Overview
Journal Am J Epidemiol
Specialty Public Health
Date 2017 Oct 12
PMID 29020131
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

In this paper, we propose a structural framework for population-based cancer epidemiology and evaluate the performance of double-robust estimators for a binary exposure in cancer mortality. We conduct numerical analyses to study the bias and efficiency of these estimators. Furthermore, we compare 2 different model selection strategies based on 1) Akaike's Information Criterion and the Bayesian Information Criterion and 2) machine learning algorithms, and we illustrate double-robust estimators' performance in a real-world setting. In simulations with correctly specified models and near-positivity violations, all but the naive estimators had relatively good performance. However, the augmented inverse-probability-of-treatment weighting estimator showed the largest relative bias. Under dual model misspecification and near-positivity violations, all double-robust estimators were biased. Nevertheless, the targeted maximum likelihood estimator showed the best bias-variance trade-off, more precise estimates, and appropriate 95% confidence interval coverage, supporting the use of the data-adaptive model selection strategies based on machine learning algorithms. We applied these methods to estimate adjusted 1-year mortality risk differences in 183,426 lung cancer patients diagnosed after admittance to an emergency department versus persons with a nonemergency cancer diagnosis in England (2006-2013). The adjusted mortality risk (for patients diagnosed with lung cancer after admittance to an emergency department) was 16% higher in men and 18% higher in women, suggesting the importance of interventions targeting early detection of lung cancer signs and symptoms.

Citing Articles

Machine learning in causal inference for epidemiology.

Moccia C, Moirano G, Popovic M, Pizzi C, Fariselli P, Richiardi L Eur J Epidemiol. 2024; 39(10):1097-1108.

PMID: 39535572 PMC: 11599438. DOI: 10.1007/s10654-024-01173-x.


SARS-CoV-2 infection by trimester of pregnancy and adverse perinatal outcomes: a Mexican retrospective cohort study.

Ghosh R, Gutierrez J, Ascencio-Montiel I, Juarez-Flores A, Bertozzi S BMJ Open. 2024; 14(4):e075928.

PMID: 38604636 PMC: 11015228. DOI: 10.1136/bmjopen-2023-075928.


Deep Ensemble Machine Learning Framework for the Estimation of Concentrations.

Yu W, Li S, Ye T, Xu R, Song J, Guo Y Environ Health Perspect. 2022; 130(3):37004.

PMID: 35254864 PMC: 8901043. DOI: 10.1289/EHP9752.


Impact of androgen deprivation therapy on mortality of prostate cancer patients with COVID-19: a propensity score-based analysis.

Duarte M, Leal F, Argenton J, Carvalheira J Infect Agent Cancer. 2021; 16(1):66.

PMID: 34823563 PMC: 8614632. DOI: 10.1186/s13027-021-00406-y.


Introduction to computational causal inference using reproducible Stata, R, and Python code: A tutorial.

Smith M, Mansournia M, Maringe C, Zivich P, Cole S, Leyrat C Stat Med. 2021; 41(2):407-432.

PMID: 34713468 PMC: 11795351. DOI: 10.1002/sim.9234.


References
1.
Tataru D, Jack R, Lind M, Moller H, Luchtenborg M . The effect of emergency presentation on surgery and survival in lung cancer patients in England, 2006-2008. Cancer Epidemiol. 2015; 39(4):612-6. DOI: 10.1016/j.canep.2015.04.008. View

2.
Leon S, Tsiatis A, Davidian M . Semiparametric estimation of treatment effect in a pretest-posttest study. Biometrics. 2004; 59(4):1046-55. DOI: 10.1111/j.0006-341x.2003.00120.x. View

3.
Keil A, Edwards J, Richardson D, Naimi A, Cole S . The parametric g-formula for time-to-event data: intuition and a worked example. Epidemiology. 2014; 25(6):889-97. PMC: 4310506. DOI: 10.1097/EDE.0000000000000160. View

4.
Pirracchio R, Petersen M, Carone M, Resche Rigon M, Chevret S, van der Laan M . Mortality prediction in intensive care units with the Super ICU Learner Algorithm (SICULA): a population-based study. Lancet Respir Med. 2014; 3(1):42-52. PMC: 4321691. DOI: 10.1016/S2213-2600(14)70239-5. View

5.
Siesling S, Louwman W, Kwast A, van den Hurk C, OCallaghan M, Rosso S . Uses of cancer registries for public health and clinical research in Europe: Results of the European Network of Cancer Registries survey among 161 population-based cancer registries during 2010-2012. Eur J Cancer. 2014; 51(9):1039-49. DOI: 10.1016/j.ejca.2014.07.016. View