» Articles » PMID: 10716175

Comparison of Sequence Profiles. Strategies for Structural Predictions Using Sequence Information

Overview
Journal Protein Sci
Specialty Biochemistry
Date 2000 Mar 15
PMID 10716175
Citations 199
Authors
Affiliations
Soon will be listed here.
Abstract

Distant homologies between proteins are often discovered only after three-dimensional structures of both proteins are solved. The sequence divergence for such proteins can be so large that simple comparison of their sequences fails to identify any similarity. New generation of sensitive alignment tools use averaged sequences of entire homologous families (profiles) to detect such homologies. Several algorithms, including the newest generation of BLAST algorithms and BASIC, an algorithm used in our group to assign fold predictions for proteins from several genomes, are compared to each other on the large set of structurally similar proteins with little sequence similarity. Proteins in the benchmark are classified according to the level of their similarity, which allows us to demonstrate that most of the improvement of the new algorithms is achieved for proteins with strong functional similarities, with almost no progress in recognizing distant fold similarities. It is also shown that details of profile calculation strongly influence its sensitivity in recognizing distant homologies. The most important choice is how to include information from diverging members of the family, avoiding generating false predictions, while accounting for entire sequence divergence within a family. PSI-BLAST takes a conservative approach, deriving a profile from core members of the family, providing a solid improvement without almost any false predictions. BASIC strives for better sensitivity by increasing the weight of divergent family members and paying the price in lower reliability. A new FFAS algorithm introduced here uses a new procedure for profile generation that takes into account all the relations within the family and matches BASIC sensitivity with PSI-BLAST like reliability.

Citing Articles

AI-Driven Deep Learning Techniques in Protein Structure Prediction.

Chen L, Li Q, Nasif K, Xie Y, Deng B, Niu S Int J Mol Sci. 2024; 25(15).

PMID: 39125995 PMC: 11313475. DOI: 10.3390/ijms25158426.


Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.

Huang B, Kong L, Wang C, Ju F, Zhang Q, Zhu J Genomics Proteomics Bioinformatics. 2023; 21(5):913-925.

PMID: 37001856 PMC: 10928435. DOI: 10.1016/j.gpb.2022.11.014.


Contact-Assisted Threading in Low-Homology Protein Modeling.

Bhattacharya S, Roche R, Shuvo M, Moussad B, Bhattacharya D Methods Mol Biol. 2023; 2627:41-59.

PMID: 36959441 PMC: 10340115. DOI: 10.1007/978-1-0716-2974-1_3.


CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields.

Lee S, Joo K, Sim S, Lee J, Lee I, Lee J Molecules. 2022; 27(12).

PMID: 35744836 PMC: 9231382. DOI: 10.3390/molecules27123711.


Methods for discovering catalytic activities for pseudokinases.

Black M, Gradowski M, Pawlowski K, Tagliabracci V Methods Enzymol. 2022; 667:575-610.

PMID: 35525554 PMC: 9554938. DOI: 10.1016/bs.mie.2022.03.047.


References
1.
NEEDLEMAN S, Wunsch C . A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970; 48(3):443-53. DOI: 10.1016/0022-2836(70)90057-4. View

2.
Murzin A . Structure classification-based assessment of CASP3 predictions for the fold recognition targets. Proteins. 1999; Suppl 3:88-103. DOI: 10.1002/(sici)1097-0134(1999)37:3+<88::aid-prot13>3.3.co;2-v. View

3.
Altschul S, Gish W, Miller W, Myers E, Lipman D . Basic local alignment search tool. J Mol Biol. 1990; 215(3):403-10. DOI: 10.1016/S0022-2836(05)80360-2. View

4.
Sander C, Schneider R . The HSSP data base of protein structure-sequence alignments. Nucleic Acids Res. 1993; 21(13):3105-9. PMC: 309738. DOI: 10.1093/nar/21.13.3105. View

5.
Henikoff S, Henikoff J . Position-based sequence weights. J Mol Biol. 1994; 243(4):574-8. DOI: 10.1016/0022-2836(94)90032-9. View