Calibration and Validation of the Colorectal Cancer and Adenoma Incidence and Mortality (CRC-AIM) Microsimulation Model Using Deep Neural Networks

Overview

Journal Med Decis Making

Publisher Sage Publications

Date 2023 Jul 12

PMID 37434445

Authors

Vahab Vahdat

Oguzhan Alagoz

Jing Voon Chen

Leila Saoud

Bijan J Borah

Paul J Limburg

Affiliations

Soon will be listed here.

Abstract

Objectives: Machine learning (ML)-based emulators improve the calibration of decision-analytical models, but their performance in complex microsimulation models is yet to be determined.

Methods: We demonstrated the use of an ML-based emulator with the Colorectal Cancer (CRC)-Adenoma Incidence and Mortality (CRC-AIM) model, which includes 23 unknown natural history input parameters to replicate the CRC epidemiology in the United States. We first generated 15,000 input combinations and ran the CRC-AIM model to evaluate CRC incidence, adenoma size distribution, and the percentage of small adenoma detected by colonoscopy. We then used this data set to train several ML algorithms, including deep neural network (DNN), random forest, and several gradient boosting variants (i.e., XGBoost, LightGBM, CatBoost) and compared their performance. We evaluated 10 million potential input combinations using the selected emulator and examined input combinations that best estimated observed calibration targets. Furthermore, we cross-validated outcomes generated by the CRC-AIM model with those made by CISNET models. The calibrated CRC-AIM model was externally validated using the United Kingdom Flexible Sigmoidoscopy Screening Trial (UKFSST).

Results: The DNN with proper preprocessing outperformed other tested ML algorithms and successfully predicted all 8 outcomes for different input combinations. It took 473 s for the trained DNN to predict outcomes for 10 million inputs, which would have required 190 CPU-years without our DNN. The overall calibration process took 104 CPU-days, which included building the data set, training, selecting, and hyperparameter tuning of the ML algorithms. While 7 input combinations had acceptable fit to the targets, a combination that best fits all outcomes was selected as the best vector. Almost all of the predictions made by the best vector laid within those from the CISNET models, demonstrating CRC-AIM's cross-model validity. Similarly, CRC-AIM accurately predicted the hazard ratios of CRC incidence and mortality as reported by UKFSST, demonstrating its external validity. Examination of the impact of calibration targets suggested that the selection of the calibration target had a substantial impact on model outcomes in terms of life-year gains with screening.

Conclusions: Emulators such as a DNN that is meticulously selected and trained can substantially reduce the computational burden of calibrating complex microsimulation models.

Highlights: Calibrating a microsimulation model, a process to find unobservable parameters so that the model fits observed data, is computationally complex.We used a deep neural network model, a popular machine learning algorithm, to calibrate the Colorectal Cancer Adenoma Incidence and Mortality (CRC-AIM) model.We demonstrated that our approach provides an efficient and accurate method to significantly speed up calibration in microsimulation models.The calibration process successfully provided cross-model validation of CRC-AIM against 3 established CISNET models and also externally validated against a randomized controlled trial.

Citing Articles

Environmental impact of colorectal cancer screening with colonoscopy and multi-target stool DNA (mt-sDNA) testing.

Alcock R, Shaukat A, Kisiel J, Hernandez L, Delarmente B, Estes C Health Aff Sch. 2025; 3(3):qxaf041.

PMID: 40078452 PMC: 11897791. DOI: 10.1093/haschl/qxaf041.

Impact of racial disparities in follow-up and quality of colonoscopy on colorectal cancer outcomes.

Alagoz O, May F, Doubeni C, Fendrick A, Vahdat V, Estes C J Natl Cancer Inst. 2024; 116(11):1807-1816.

PMID: 39044335 PMC: 11542987. DOI: 10.1093/jnci/djae140.

Optimal timing of a colonoscopy screening schedule depends on adenoma detection, adenoma risk, adherence to screening and the screening objective: A microsimulation study.

Zaika V, Prakash M, Cheng C, Schlander M, Lang B, Beerenwinkel N PLoS One. 2024; 19(5):e0304374.

PMID: 38787836 PMC: 11125540. DOI: 10.1371/journal.pone.0304374.

Modeling Thyroid Cancer Epidemiology in the United States Using Papillary Thyroid Carcinoma Microsimulation Model.

Alagoz O, Zhang Y, Arroyo N, Fernandes-Taylor S, Yang D, Krebsbach C Value Health. 2023; 27(3):367-375.

PMID: 38141816 PMC: 10922958. DOI: 10.1016/j.jval.2023.12.007.

A Call to Action to Increase Uptake of Follow-Up Colonoscopy After Initial Positive Stool-Based Colorectal Cancer Screening.

Fendrick A, Kisiel J, Brooks D, Vahdat V, Estes C, Ebner D Popul Health Manag. 2023; 26(6):448-450.

PMID: 37930304 PMC: 10698770. DOI: 10.1089/pop.2023.0199.

References

M de Carvalho T, van Rosmalen J, Wolff H, Koffijberg H, Coupe V . Choosing a Metamodel of a Simulation Model for Uncertainty Quantification. Med Decis Making. 2021; 42(1):28-42. DOI: 10.1177/0272989X211016307. View

Siegel R, Fedewa S, Anderson W, Miller K, Ma J, Rosenberg P . Colorectal Cancer Incidence Patterns in the United States, 1974-2013. J Natl Cancer Inst. 2017; 109(8). PMC: 6059239. DOI: 10.1093/jnci/djw322. View

Edwards B, Ward E, Kohler B, Eheman C, Zauber A, Anderson R . Annual report to the nation on the status of cancer, 1975-2006, featuring colorectal cancer trends and impact of interventions (risk factors, screening, and treatment) to reduce future rates. Cancer. 2009; 116(3):544-73. PMC: 3619726. DOI: 10.1002/cncr.24760. View

Samowitz W, Albertsen H, Herrick J, Levin T, Sweeney C, Murtaugh M . Evaluation of a large, population-based sample supports a CpG island methylator phenotype in colon cancer. Gastroenterology. 2005; 129(3):837-45. DOI: 10.1053/j.gastro.2005.06.020. View

Greuter M, Xu X, Lew J, Dekker E, Kuipers E, Canfell K . Modeling the Adenoma and Serrated pathway to Colorectal CAncer (ASCCA). Risk Anal. 2013; 34(5):889-910. DOI: 10.1111/risa.12137. View

Reddy K, Bulteel A, Levy D, Torola P, Hyle E, Hou T . Novel microsimulation model of tobacco use behaviours and outcomes: calibration and validation in a US population. BMJ Open. 2020; 10(5):e032579. PMC: 7228509. DOI: 10.1136/bmjopen-2019-032579. View

McCandlish J, Ayer T, Chhatwal J . Cost-Effectiveness and Value-of-Information Analysis Using Machine Learning-Based Metamodeling: A Case of Hepatitis C Treatment. Med Decis Making. 2022; 43(1):68-77. DOI: 10.1177/0272989X221125418. View

Brenner H, Altenhofen L, Katalinic A, Lansdorp-Vogelaar I, Hoffmeister M . Sojourn time of preclinical colorectal cancer by sex and age: estimates from the German national screening colonoscopy database. Am J Epidemiol. 2011; 174(10):1140-6. DOI: 10.1093/aje/kwr188. View

Rutter C, Miglioretti D, Savarino J . Bayesian Calibration of Microsimulation Models. J Am Stat Assoc. 2010; 104(488):1338-1350. PMC: 2805837. DOI: 10.1198/jasa.2009.ap07466. View

10.

Luo W, Katz D, Hamilton D, McKenney J, Jenness S, Goodreau S . Development of an Agent-Based Model to Investigate the Impact of HIV Self-Testing Programs on Men Who Have Sex With Men in Atlanta and Seattle. JMIR Public Health Surveill. 2018; 4(2):e58. PMC: 6045793. DOI: 10.2196/publichealth.9357. View

11.

Sai A, Vivas-Valencia C, Imperiale T, Kong N . Multiobjective Calibration of Disease Simulation Models Using Gaussian Processes. Med Decis Making. 2019; 39(5):540-552. PMC: 6786931. DOI: 10.1177/0272989X19862560. View

12.

Knudsen A, Zauber A, Rutter C, Naber S, Doria-Rose V, Pabiniak C . Estimation of Benefits, Burden, and Harms of Colorectal Cancer Screening Strategies: Modeling Study for the US Preventive Services Task Force. JAMA. 2016; 315(23):2595-609. PMC: 5493310. DOI: 10.1001/jama.2016.6828. View

13.

Alarid-Escudero F, MacLehose R, Peralta Y, M Kuntz K, Enns E . Nonidentifiability in Model Calibration and Implications for Medical Decision Making. Med Decis Making. 2018; 38(7):810-821. PMC: 6156799. DOI: 10.1177/0272989X18792283. View

14.

Corley D, Jensen C, Marks A, Zhao W, de Boer J, Levin T . Variation of adenoma prevalence by age, sex, race, and colon location in a large population: implications for screening and quality programs. Clin Gastroenterol Hepatol. 2012; 11(2):172-80. PMC: 3954741. DOI: 10.1016/j.cgh.2012.09.010. View

15.

Goedel W, King M, Lurie M, Nunn A, Chan P, Marshall B . Effect of Racial Inequities in Pre-exposure Prophylaxis Use on Racial Disparities in HIV Incidence Among Men Who Have Sex With Men: A Modeling Study. J Acquir Immune Defic Syndr. 2018; 79(3):323-329. PMC: 6342014. DOI: 10.1097/QAI.0000000000001817. View

16.

Ryckman T, Luby S, Owens D, Bendavid E, Goldhaber-Fiebert J . Methods for Model Calibration under High Uncertainty: Modeling Cholera in Bangladesh. Med Decis Making. 2020; 40(5):693-709. DOI: 10.1177/0272989X20938683. View

17.

Arias E . United States Life Tables, 2017. Natl Vital Stat Rep. 2020; 68(7):1-66. View

18.

Ladabaum U, CHOPRA C, Huang G, Scheiman J, Chernew M, Fendrick A . Aspirin as an adjunct to screening for prevention of sporadic colorectal cancer. A cost-effectiveness analysis. Ann Intern Med. 2001; 135(9):769-81. DOI: 10.7326/0003-4819-135-9-200111060-00007. View

19.

Myers E, McCrory D, Nanda K, Bastian L, Matchar D . Mathematical model for the natural history of human papillomavirus infection and cervical carcinogenesis. Am J Epidemiol. 2000; 151(12):1158-71. DOI: 10.1093/oxfordjournals.aje.a010166. View

20.

Sweetser S, Smyrk T, Sinicrope F . Serrated colon polyps as precursors to colorectal cancer. Clin Gastroenterol Hepatol. 2012; 11(7):760-7. PMC: 3628288. DOI: 10.1016/j.cgh.2012.12.004. View