» Articles » PMID: 16103904

Predicting Functional Gene Links from Phylogenetic-statistical Analyses of Whole Genomes

Overview
Specialty Biology
Date 2005 Aug 17
PMID 16103904
Citations 85
Authors
Affiliations
Soon will be listed here.
Abstract

An important element of the developing field of proteomics is to understand protein-protein interactions and other functional links amongst genes. Across-species correlation methods for detecting functional links work on the premise that functionally linked proteins will tend to show a common pattern of presence and absence across a range of genomes. We describe a maximum likelihood statistical model for predicting functional gene linkages. The method detects independent instances of the correlated gain or loss of pairs of proteins on phylogenetic trees, reducing the high rates of false positives observed in conventional across-species methods that do not explicitly incorporate a phylogeny. We show, in a dataset of 10,551 protein pairs, that the phylogenetic method improves by up to 35% on across-species analyses at identifying known functionally linked proteins. The method shows that protein pairs with at least two to three correlated events of gain or loss are almost certainly functionally linked. Contingent evolution, in which one gene's presence or absence depends upon the presence of another, can also be detected phylogenetically, and may identify genes whose functional significance depends upon its interaction with other genes. Incorporating phylogenetic information improves the prediction of functional linkages. The improvement derives from having a lower rate of false positives and from detecting trends that across-species analyses miss. Phylogenetic methods can easily be incorporated into the screening of large-scale bioinformatics datasets to identify sets of protein links and to characterise gene networks.

Citing Articles

Comprehensive identification of GASA genes in sunflower and expression profiling in response to drought.

Asad Ullah M, Ahmed M, AlHusnain L, Zia M, AlKahtani M, Attia K BMC Genomics. 2024; 25(1):954.

PMID: 39402437 PMC: 11472593. DOI: 10.1186/s12864-024-10860-8.


Accurate Detection of Convergent Mutations in Large Protein Alignments With ConDor.

Morel M, Zhukova A, Lemoine F, Gascuel O Genome Biol Evol. 2024; 16(4).

PMID: 38451738 PMC: 10986858. DOI: 10.1093/gbe/evae040.


Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions.

Dembech E, Malatesta M, De Rito C, Mori G, Cavazzini D, Secchi A Proc Natl Acad Sci U S A. 2023; 120(16):e2218329120.

PMID: 37043529 PMC: 10120013. DOI: 10.1073/pnas.2218329120.


Molecular Phylogeny of Cimicoidea (Heteroptera: Cimicomorpha) Revisited: Increased Taxon Sampling Reveals Evolution of Traumatic Insemination and Paragenitalia.

Jung S, Kim J, Balvin O, Yamada K Insects. 2023; 14(3).

PMID: 36975952 PMC: 10051671. DOI: 10.3390/insects14030267.


Computational models for prediction of protein-protein interaction in rice and .

Karan B, Mahapatra S, Sahu S, Pandey D, Chakravarty S Front Plant Sci. 2023; 13:1046209.

PMID: 36816487 PMC: 9929577. DOI: 10.3389/fpls.2022.1046209.


References
1.
Rokas A, Williams B, King N, Carroll S . Genome-scale approaches to resolving incongruence in molecular phylogenies. Nature. 2003; 425(6960):798-804. DOI: 10.1038/nature02053. View

2.
von Mering C, Krause R, Snel B, Cornell M, Oliver S, Fields S . Comparative assessment of large-scale data sets of protein-protein interactions. Nature. 2002; 417(6887):399-403. DOI: 10.1038/nature750. View

3.
Pazos F, Valencia A . Similarity of phylogenetic trees as indicator of protein-protein interaction. Protein Eng. 2001; 14(9):609-14. DOI: 10.1093/protein/14.9.609. View

4.
Date S, Marcotte E . Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages. Nat Biotechnol. 2003; 21(9):1055-62. DOI: 10.1038/nbt861. View

5.
Mao H, White S, Williamson J . A novel loop-loop recognition motif in the yeast ribosomal protein L30 autoregulatory RNA complex. Nat Struct Biol. 1999; 6(12):1139-47. DOI: 10.1038/70081. View