» Articles » PMID: 11861885

Extent of Gene Duplication in the Genomes of Drosophila, Nematode, and Yeast

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2002 Feb 28
PMID 11861885
Citations 242
Authors
Affiliations
Soon will be listed here.
Abstract

We conducted a detailed analysis of duplicate genes in three complete genomes: yeast, Drosophila, and Caenorhabditis elegans. For two proteins belonging to the same family we used the criteria: (1) their similarity is > or =I (I = 30% if L > or = 150 a.a. and I = 0.01n + 4.8L(-0.32(1 + exp(-L/1000))) if L < 150 a.a., where n = 6 and L is the length of the alignable region), and (2) the length of the alignable region between the two sequences is > or = 80% of the longer protein. We found it very important to delete isoforms (caused by alternative splicing), same genes with different names, and proteins derived from repetitive elements. We estimated that there were 530, 674, and 1,219 protein families in yeast, Drosophila, and C. elegans, respectively, so, as expected, yeast has the smallest number of duplicate genes. However, for the duplicate pairs with the number of substitutions per synonymous site (K(S)) < 0.01, Drosophila has only seven pairs, whereas yeast has 58 pairs and nematode has 153 pairs. After considering the possible effects of codon usage bias and gene conversion, these numbers became 6, 55, and 147, respectively. Thus, Drosophila appears to have much fewer young duplicate genes than do yeast and nematode. The larger numbers of duplicate pairs with K(S) < 0.01 in yeast and C. elegans were probably largely caused by block duplications. At any rate, it is clear that the genome of Drosophila melanogaster has undergone few gene duplications in the recent past and has much fewer gene families than C. elegans.

Citing Articles

Genome-Wide Exploration and Characterization of the Gene Family's Expression Patterns in Response to Abiotic Stresses in Siberian Wildrye ( L.).

Liu T, Peng J, Dong Z, Liu Y, Wu J, Xiong Y Int J Mol Sci. 2025; 26(5).

PMID: 40076552 PMC: 11900556. DOI: 10.3390/ijms26051925.


Genome-wide identification, characterization, and functional analysis of the CHX, SOS, and RLK genes in Solanum lycopersicum under salt stress.

Maghraby A, Alzalaty M Sci Rep. 2025; 15(1):1142.

PMID: 39774029 PMC: 11707246. DOI: 10.1038/s41598-024-83221-w.


Evolutionary analysis of TIR- and non-TIR-NBS-LRR disease resistance genes in wild strawberries.

Zhu N, Feng Y, Shi G, Zhang Q, Yuan B, Qiao Q Front Plant Sci. 2024; 15:1452251.

PMID: 39640992 PMC: 11617207. DOI: 10.3389/fpls.2024.1452251.


Identification and characterization of MADS-box gene family in flax, L. and its role under abiotic stress.

Lu J, Wu H, Pitt D, Liu X, Song X, Yuan H iScience. 2024; 27(12):111092.

PMID: 39618497 PMC: 11607607. DOI: 10.1016/j.isci.2024.111092.


Molecular characteristics and expression pattern of the FAR1 gene during spike sprouting in quinoa.

Huang L, Zhang L, Zhang P, Liu J, Li L, Li H Sci Rep. 2024; 14(1):28485.

PMID: 39557968 PMC: 11573983. DOI: 10.1038/s41598-024-79474-0.