» Articles » PMID: 40078374

Foundation Models in Bioinformatics

Overview
Journal Natl Sci Rev
Date 2025 Mar 13
PMID 40078374
Authors
Affiliations
Soon will be listed here.
Abstract

With the adoption of foundation models (FMs), artificial intelligence (AI) has become increasingly significant in bioinformatics and has successfully addressed many historical challenges, such as pre-training frameworks, model evaluation and interpretability. FMs demonstrate notable proficiency in managing large-scale, unlabeled datasets, because experimental procedures are costly and labor intensive. In various downstream tasks, FMs have consistently achieved noteworthy results, demonstrating high levels of accuracy in representing biological entities. A new era in computational biology has been ushered in by the application of FMs, focusing on both general and specific biological issues. In this review, we introduce recent advancements in bioinformatics FMs employed in a variety of downstream tasks, including genomics, transcriptomics, proteomics, drug discovery and single-cell analysis. Our aim is to assist scientists in selecting appropriate FMs in bioinformatics, according to four model types: language FMs, vision FMs, graph FMs and multimodal FMs. In addition to understanding molecular landscapes, AI technology can establish the theoretical and practical foundation for continued innovation in molecular biology.

References
1.
Moreno P, Fexova S, George N, Manning J, Miao Z, Mohammed S . Expression Atlas update: gene and protein expression in multiple species. Nucleic Acids Res. 2021; 50(D1):D129-D140. PMC: 8728300. DOI: 10.1093/nar/gkab1030. View

2.
de Souza N . The ENCODE project. Nat Methods. 2013; 9(11):1046. DOI: 10.1038/nmeth.2238. View

3.
Benegas G, Batra S, Song Y . DNA language models are powerful predictors of genome-wide variant effects. Proc Natl Acad Sci U S A. 2023; 120(44):e2311219120. PMC: 10622914. DOI: 10.1073/pnas.2311219120. View

4.
Moor M, Banerjee O, Shakeri Hossein Abad Z, Krumholz H, Leskovec J, Topol E . Foundation models for generalist medical artificial intelligence. Nature. 2023; 616(7956):259-265. DOI: 10.1038/s41586-023-05881-4. View

5.
Rao R, Bhattacharya N, Thomas N, Duan Y, Chen X, Canny J . Evaluating Protein Transfer Learning with TAPE. Adv Neural Inf Process Syst. 2021; 32:9689-9701. PMC: 7774645. View