» Articles » PMID: 37957897

Patterns of Gene Expression Profiles Associated with Colorectal Cancer in Colorectal Mucosa by Using Machine Learning Methods

Overview
Specialty Chemistry
Date 2023 Nov 14
PMID 37957897
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Colorectal cancer (CRC) has a very high incidence and lethality rate and is one of the most dangerous cancer types. Timely diagnosis can effectively reduce the incidence of colorectal cancer. Changes in para-cancerous tissues may serve as an early signal for tumorigenesis. Comparison of the differences in gene expression between para-cancerous and normal mucosa can help in the diagnosis of CRC and understanding the mechanisms of development.

Objectives: This study aimed to identify specific genes at the level of gene expression, which are expressed in normal mucosa and may be predictive of CRC risk.

Methods: A machine learning approach was used to analyze transcriptomic data in 459 samples of normal colonic mucosal tissue from 322 CRC cases and 137 non-CRC, in which each sample contained 28,706 gene expression levels. The genes were ranked using four ranking methods based on importance estimation (LASSO, LightGBM, MCFS, and mRMR) and four classification algorithms (decision tree [DT], K-nearest neighbor [KNN], random forest [RF], and support vector machine [SVM]) were combined with incremental feature selection [IFS] methods to construct a prediction model with excellent performance.

Result: The top-ranked genes, namely, , and , were associated with tumorigenesis based on previous studies.

Conclusion: This study summarized four sets of quantitative classification rules based on the DT algorithm, providing clues for understanding the microenvironmental changes caused by CRC. According to the rules, the effect of CRC on normal mucosa can be determined.

Citing Articles

Herb-disease association prediction model based on network consistency projection.

Chen L, Zhang S, Zhou B Sci Rep. 2025; 15(1):3328.

PMID: 39865145 PMC: 11770172. DOI: 10.1038/s41598-025-87521-7.


CMAGN: circRNA-miRNA association prediction based on graph attention auto-encoder and network consistency projection.

Yin A, Chen L, Zhou B, Cai Y BMC Bioinformatics. 2024; 25(1):336.

PMID: 39449126 PMC: 11515630. DOI: 10.1186/s12859-024-05959-4.


Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types.

Ma Q, Zhang Y, Guo W, Feng K, Huang T, Cai Y Life (Basel). 2024; 14(8).

PMID: 39202774 PMC: 11355424. DOI: 10.3390/life14081032.


Machine Learning Reveals Impacts of Smoking on Gene Profiles of Different Cell Types in Lung.

Ma Q, Shen Y, Guo W, Feng K, Huang T, Cai Y Life (Basel). 2024; 14(4).

PMID: 38672772 PMC: 11051039. DOI: 10.3390/life14040502.

References
1.
Brustugun O, Moller B, Helland A . Years of life lost as a measure of cancer burden on a national level. Br J Cancer. 2014; 111(5):1014-20. PMC: 4150272. DOI: 10.1038/bjc.2014.364. View

2.
Sung H, Ferlay J, Siegel R, Laversanne M, Soerjomataram I, Jemal A . Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021; 71(3):209-249. DOI: 10.3322/caac.21660. View

3.
Siegel R, Miller K, Fuchs H, Jemal A . Cancer Statistics, 2021. CA Cancer J Clin. 2021; 71(1):7-33. DOI: 10.3322/caac.21654. View

4.
Fearon E, Vogelstein B . A genetic model for colorectal tumorigenesis. Cell. 1990; 61(5):759-67. DOI: 10.1016/0092-8674(90)90186-i. View

5.
Amaro A, Chiara S, Pfeffer U . Molecular evolution of colorectal cancer: from multistep carcinogenesis to the big bang. Cancer Metastasis Rev. 2016; 35(1):63-74. DOI: 10.1007/s10555-016-9606-4. View