» Articles » PMID: 15236962

Clustering of Protein Domains in the Human Genome

Overview
Journal J Mol Biol
Publisher Elsevier
Date 2004 Jul 9
PMID 15236962
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

We present a systematic study of the clustering of genes within the human genome based on homology inferred from both sequence and structural similarity. The 3D-Genomics automated proteome annotation pipeline () was utilised to infer homology for each protein domain in the genome, for the 26 superfamilies most highly represented in the Structural Classification Of Proteins (SCOP) database. This approach enabled us to identify homologues that could not be detected by sequence-based methods alone. For each superfamily, we investigated the distribution, both within and among chromosomes, of genes encoding at least one domain within the superfamily. The results indicate a diversity of clustering behaviours: some superfamilies showed no evidence of any clustering, and others displayed significant clustering either within or among chromosomes, or both. Removal of tandem repeats reduced the levels of clustering observed, but some superfamilies still displayed highly significant clustering. Thus, our study suggests that either the process of gene duplication, or the evolution of the resulting clusters, differs between structural superfamilies.

Citing Articles

Bismuth Complexes Inhibit the SARS Coronavirus.

Yang N, Tanner J, Zheng B, Watt R, He M, Lu L Angew Chem Weinheim Bergstr Ger. 2020; 119(34):6584-6588.

PMID: 32313314 PMC: 7159568. DOI: 10.1002/ange.200701021.


Using HHsearch to tackle proteins of unknown function: A pilot study with PH domains.

Fidler D, Murphy S, Courtis K, Antonoudiou P, El-Tohamy R, Ient J Traffic. 2016; 17(11):1214-1226.

PMID: 27601190 PMC: 5091641. DOI: 10.1111/tra.12432.


Clusters of ancestrally related genes that show paralogy in whole or in part are a major feature of the genomes of humans and other species.

Walker M, King B, Paigen K PLoS One. 2012; 7(4):e35274.

PMID: 22563380 PMC: 3338513. DOI: 10.1371/journal.pone.0035274.


Classifying genes to the correct Gene Ontology Slim term in Saccharomyces cerevisiae using neighbouring genes with classification learning.

Amthauer H, Tsatsoulis C BMC Genomics. 2010; 11:340.

PMID: 20509921 PMC: 2890565. DOI: 10.1186/1471-2164-11-340.


Bacterial pleckstrin homology domains: a prokaryotic origin for the PH domain.

Xu Q, Bateman A, Finn R, Abdubek P, Astakhova T, Axelrod H J Mol Biol. 2009; 396(1):31-46.

PMID: 19913036 PMC: 2817789. DOI: 10.1016/j.jmb.2009.11.006.