» Articles » PMID: 36860337

Machine Learning-based Prediction of Candidate Gene Biomarkers Correlated with Immune Infiltration in Patients with Idiopathic Pulmonary Fibrosis

Overview
Specialty General Medicine
Date 2023 Mar 2
PMID 36860337
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: This study aimed to identify candidate gene biomarkers associated with immune infiltration in idiopathic pulmonary fibrosis (IPF) based on machine learning algorithms.

Methods: Microarray datasets of IPF were extracted from the Gene Expression Omnibus (GEO) database to screen for differentially expressed genes (DEGs). The DEGs were subjected to enrichment analysis, and two machine learning algorithms were used to identify candidate genes associated with IPF. These genes were verified in a validation cohort from the GEO database. Receiver operating characteristic (ROC) curves were plotted to assess the predictive value of the IPF-associated genes. The cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm was used to evaluate the proportion of immune cells in IPF and normal tissues. Additionally, the correlation between the expression of IPF-associated genes and the infiltration levels of immune cells was examined.

Results: A total of 302 upregulated and 192 downregulated genes were identified. Functional annotation, pathway enrichment, Disease Ontology and gene set enrichment analyses revealed that the DEGs were related to the extracellular matrix and immune responses. COL3A1, CDH3, CEBPD, and GPIHBP1 were identified as candidate biomarkers using machine learning algorithms, and their predictive value was verified in a validation cohort. Additionally, ROC analysis revealed that the four genes had high predictive accuracy. The infiltration levels of plasma cells, M0 macrophages and resting dendritic cells were higher and those of resting natural killer (NK) cells, M1 macrophages and eosinophils were lower in the lung tissues of patients with IPF than in those of healthy individuals. The expression of the abovementioned genes was correlated with the infiltration levels of plasma cells, M0 macrophages and eosinophils.

Conclusion: COL3A1, CDH3, CEBPD, and GPIHBP1 are candidate biomarkers of IPF. Plasma cells, M0 macrophages and eosinophils may be involved in the development of IPF and may serve as immunotherapeutic targets in IPF.

Citing Articles

Identification and validation of biomarkers related to ferroptosis in idiopathic pulmonary fibrosis.

Yue M, Luan R, Ding D, Wang Y, Xue Q, Yang J Sci Rep. 2025; 15(1):8622.

PMID: 40075162 PMC: 11904244. DOI: 10.1038/s41598-025-93217-9.


Identifying health risk determinants and molecular targets in patients with idiopathic pulmonary fibrosis via combined differential and weighted gene co-expression analysis.

Moin A, Ullah M, Nipa J, Rahman M, Emran A, Islam M Front Genet. 2025; 15:1496462.

PMID: 39944354 PMC: 11813903. DOI: 10.3389/fgene.2024.1496462.


Identifying cancer prognosis genes through causal learning.

Wu S, Yin C, Wang Y, Sun H Brief Bioinform. 2025; 26(1.

PMID: 39808115 PMC: 11729728. DOI: 10.1093/bib/bbae721.


Investigating Angiogenesis-Related Biomarkers in Osteoarthritis Patients Through Transcriptomic Profiling.

Zheng Y, Fang M, Sanan S, Meng X, Huang J, Qian Y J Inflamm Res. 2024; 17:10681-10697.

PMID: 39677287 PMC: 11638479. DOI: 10.2147/JIR.S493889.


Construction of an artificial neural network diagnostic model and investigation of immune cell infiltration characteristics for idiopathic pulmonary fibrosis.

Zhang H, Hua H, Wang C, Zhu C, Xia Q, Jiang W BMC Pulm Med. 2024; 24(1):458.

PMID: 39289672 PMC: 11409795. DOI: 10.1186/s12890-024-03249-6.


References
1.
Newman A, Steen C, Liu C, Gentles A, Chaudhuri A, Scherer F . Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat Biotechnol. 2019; 37(7):773-782. PMC: 6610714. DOI: 10.1038/s41587-019-0114-2. View

2.
Leek J, Johnson W, Parker H, Jaffe A, Storey J . The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012; 28(6):882-3. PMC: 3307112. DOI: 10.1093/bioinformatics/bts034. View

3.
Fukunaga S, Kakehashi A, Sumida K, Kushida M, Asano H, Gi M . Integrative analyses of miRNA and proteomics identify potential biological pathways associated with onset of pulmonary fibrosis in the bleomycin rat model. Toxicol Appl Pharmacol. 2015; 286(3):188-97. DOI: 10.1016/j.taap.2015.04.014. View

4.
Li C, Wang Z, Zhang J, Zhao X, Xu P, Liu X . Crosstalk of mRNA, miRNA, lncRNA, and circRNA and Their Regulatory Pattern in Pulmonary Fibrosis. Mol Ther Nucleic Acids. 2019; 18:204-218. PMC: 6796619. DOI: 10.1016/j.omtn.2019.08.018. View

5.
Raghu G, Remy-Jardin M, Myers J, Richeldi L, Ryerson C, Lederer D . Diagnosis of Idiopathic Pulmonary Fibrosis. An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med. 2018; 198(5):e44-e68. DOI: 10.1164/rccm.201807-1255ST. View