» Articles » PMID: 39906415

Evaluating the Advancements in Protein Language Models for Encoding Strategies in Protein Function Prediction: a Comprehensive Review

Overview
Date 2025 Feb 5
PMID 39906415
Authors
Affiliations
Soon will be listed here.
Abstract

Protein function prediction is crucial in several key areas such as bioinformatics and drug design. With the rapid progress of deep learning technology, applying protein language models has become a research focus. These models utilize the increasing amount of large-scale protein sequence data to deeply mine its intrinsic semantic information, which can effectively improve the accuracy of protein function prediction. This review comprehensively combines the current status of applying the latest protein language models in protein function prediction. It provides an exhaustive performance comparison with traditional prediction methods. Through the in-depth analysis of experimental results, the significant advantages of protein language models in enhancing the accuracy and depth of protein function prediction tasks are fully demonstrated.

References
1.
Zhou N, Jiang Y, Bergquist T, Lee A, Kacsoh B, Crocker A . The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens. Genome Biol. 2019; 20(1):244. PMC: 6864930. DOI: 10.1186/s13059-019-1835-8. View

2.
Kulmanov M, Hoehndorf R . DeepGOPlus: improved protein function prediction from sequence. Bioinformatics. 2019; 36(2):422-429. PMC: 9883727. DOI: 10.1093/bioinformatics/btz595. View

3.
Zhang X, Guo H, Zhang F, Wang X, Wu K, Qiu S . HNetGO: protein function prediction via heterogeneous network transformer. Brief Bioinform. 2023; 24(6). PMC: 10588005. DOI: 10.1093/bib/bbab556. View

4.
Gligorijevic V, Barot M, Bonneau R . deepNF: deep network fusion for protein function prediction. Bioinformatics. 2018; 34(22):3873-3881. PMC: 6223364. DOI: 10.1093/bioinformatics/bty440. View

5.
Yang X, Liu G, Feng G, Bu D, Wang P, Jiang J . GeneCompass: deciphering universal gene regulatory mechanisms with a knowledge-informed cross-species foundation model. Cell Res. 2024; 34(12):830-845. PMC: 11615217. DOI: 10.1038/s41422-024-01034-y. View