» Articles » PMID: 39766238

The Historical Evolution and Significance of Multiple Sequence Alignment in Molecular Structure and Function Prediction

Overview
Journal Biomolecules
Publisher MDPI
Date 2025 Jan 8
PMID 39766238
Authors
Affiliations
Soon will be listed here.
Abstract

Multiple sequence alignment (MSA) has evolved into a fundamental tool in the biological sciences, playing a pivotal role in predicting molecular structures and functions. With broad applications in protein and nucleic acid modeling, MSAs continue to underpin advancements across a range of disciplines. MSAs are not only foundational for traditional sequence comparison techniques but also increasingly important in the context of artificial intelligence (AI)-driven advancements. Recent breakthroughs in AI, particularly in protein and nucleic acid structure prediction, rely heavily on the accuracy and efficiency of MSAs to enhance remote homology detection and guide spatial restraints. This review traces the historical evolution of MSA, highlighting its significance in molecular structure and function prediction. We cover the methodologies used for protein monomers, protein complexes, and RNA, while also exploring emerging AI-based alternatives, such as protein language models, as complementary or replacement approaches to traditional MSAs in application tasks. By discussing the strengths, limitations, and applications of these methods, this review aims to provide researchers with valuable insights into MSA's evolving role, equipping them to make informed decisions in structural prediction research.

References
1.
Chowdhury R, Bouatta N, Biswas S, Floristean C, Kharkar A, Roy K . Single-sequence protein structure prediction using a language model and deep learning. Nat Biotechnol. 2022; 40(11):1617-1623. PMC: 10440047. DOI: 10.1038/s41587-022-01432-w. View

2.
Guindon S, Gascuel O . A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003; 52(5):696-704. DOI: 10.1080/10635150390235520. View

3.
Chao K, Pearson W, Miller W . Aligning two sequences within a specified diagonal band. Comput Appl Biosci. 1992; 8(5):481-7. DOI: 10.1093/bioinformatics/8.5.481. View

4.
Dowell R, Eddy S . Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints. BMC Bioinformatics. 2006; 7:400. PMC: 1579236. DOI: 10.1186/1471-2105-7-400. View

5.
Henikoff S, Henikoff J . Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992; 89(22):10915-9. PMC: 50453. DOI: 10.1073/pnas.89.22.10915. View