» Articles » PMID: 38662163

Development and Validation of a Colorectal Cancer Prediction Model: A Nationwide Cohort-Based Study

Overview
Journal Dig Dis Sci
Specialty Gastroenterology
Date 2024 Apr 25
PMID 38662163
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Early diagnosis of colorectal cancer (CRC) is critical to increasing survival rates. Computerized risk prediction models hold great promise for identifying individuals at high risk for CRC. In order to utilize such models effectively in a population-wide screening setting, development and validation should be based on cohorts that are similar to the target population.

Aim: Establish a risk prediction model for CRC diagnosis based on electronic health records (EHR) from subjects eligible for CRC screening.

Methods: A retrospective cohort study utilizing the EHR data of Clalit Health Services (CHS). The study includes CHS members aged 50-74 who were eligible for CRC screening from January 2013 to January 2019. The model was trained to predict receiving a CRC diagnosis within 2 years of the index date. Approximately 20,000 EHR demographic and clinical features were considered.

Results: The study includes 2935 subjects with CRC diagnosis, and 1,133,457 subjects without CRC diagnosis. Incidence values of CRC among subjects in the top 1% risk scores were higher than baseline (2.3% vs 0.3%; lift 8.38; P value < 0.001). Cumulative event probabilities increased with higher model scores. Model-based risk stratification among subjects with a positive FOBT, identified subjects with more than twice the risk for CRC compared to FOBT alone.

Conclusions: We developed an individualized risk prediction model for CRC that can be utilized as a complementary decision support tool for healthcare providers to precisely identify subjects at high risk for CRC and refer them for confirmatory testing.

References
1.
Liang H, Yang L, Tao L, Shi L, Yang W, Bai J . Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review. Chin J Cancer Res. 2020; 32(2):242-251. PMC: 7219096. DOI: 10.21147/j.issn.1000-9604.2020.02.11. View

2.
Lundberg S, Erion G, Chen H, DeGrave A, Prutkin J, Nair B . From Local Explanations to Global Understanding with Explainable AI for Trees. Nat Mach Intell. 2020; 2(1):56-67. PMC: 7326367. DOI: 10.1038/s42256-019-0138-9. View

3.
He M, Fang Z, Hang D, Wang F, Polychronidis G, Wang L . Circulating liver function markers and colorectal cancer risk: A prospective cohort study in the UK Biobank. Int J Cancer. 2020; 148(8):1867-1878. PMC: 8244830. DOI: 10.1002/ijc.33351. View

4.
Yang Z, Tang H, Lu S, Sun X, Rao B . Relationship between serum lipid level and colorectal cancer: a systemic review and meta-analysis. BMJ Open. 2022; 12(6):e052373. PMC: 9226934. DOI: 10.1136/bmjopen-2021-052373. View

5.
Lee E, Jung S, Hwang H, Jung J . Patient-Level Cancer Prediction Models From a Nationwide Patient Cohort: Model Development and Validation. JMIR Med Inform. 2021; 9(8):e29807. PMC: 8438609. DOI: 10.2196/29807. View