» Articles » PMID: 18073194

PlantTribes: a Gene and Gene Family Resource for Comparative Genomics in Plants

Overview
Specialty Biochemistry
Date 2007 Dec 13
PMID 18073194
Citations 47
Authors
Affiliations
Soon will be listed here.
Abstract

The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575-1584)] to classify all of these species' protein-coding genes into putative gene families, called tribes, using three clustering stringencies (low, medium and high). For all tribes, we have generated protein and DNA alignments and maximum-likelihood phylogenetic trees. A parallel database of microarray experimental results is linked to the genes, which lets researchers identify groups of related genes and their expression patterns. Unified nomenclatures were developed, and tribes can be related to traditional gene families and conserved domain identifiers. SuperTribes, constructed through a second iteration of MCL clustering, connect distant, but potentially related gene clusters. The global classification of nearly 200 000 plant proteins was used as a scaffold for sorting approximately 4 million additional cDNA sequences from over 200 plant species. All data and analyses are accessible through a flexible interface allowing users to explore the classification, to place query sequences within the classification, and to download results for further study.

Citing Articles

A combination of conserved and diverged responses underlies Theobroma cacao's defense response to Phytophthora palmivora.

Winters N, Wafula E, Knollenberg B, Hamala T, Timilsena P, Perryman M BMC Biol. 2024; 22(1):38.

PMID: 38360697 PMC: 10870529. DOI: 10.1186/s12915-024-01831-2.


PlantTribes2: Tools for comparative gene family analysis in plant genomics.

Wafula E, Zhang H, Von Kuster G, Leebens-Mack J, Honaas L, dePamphilis C Front Plant Sci. 2023; 13:1011199.

PMID: 36798801 PMC: 9928214. DOI: 10.3389/fpls.2022.1011199.


Alkaloid production and response to natural adverse conditions in : transcriptome analyses.

Jazayeri S, Pooralinaghi M, Torres-Navarrete Y, Oviedo-Bayas B, Guerra I, Jacome D BioTechnologia (Pozn). 2023; 103(4):355-384.

PMID: 36685700 PMC: 9837557. DOI: 10.5114/bta.2022.120706.


Phylotranscriptomic Analyses of Mycoheterotrophic Monocots Show a Continuum of Convergent Evolutionary Changes in Expressed Nuclear Genes From Three Independent Nonphotosynthetic Lineages.

Timilsena P, Barrett C, Pineyro-Nelson A, Wafula E, Ayyampalayam S, McNeal J Genome Biol Evol. 2022; 15(1).

PMID: 36582124 PMC: 9887272. DOI: 10.1093/gbe/evac183.


Phylogenomic resolution of order- and family-level monocot relationships using 602 single-copy nuclear genes and 1375 BUSCO genes.

Timilsena P, Wafula E, Barrett C, Ayyampalayam S, McNeal J, Rentsch J Front Plant Sci. 2022; 13:876779.

PMID: 36483967 PMC: 9723157. DOI: 10.3389/fpls.2022.876779.


References
1.
Craigon D, James N, Okyere J, Higgins J, Jotham J, May S . NASCArrays: a repository for microarray data generated by NASC's transcriptomics service. Nucleic Acids Res. 2003; 32(Database issue):D575-7. PMC: 308867. DOI: 10.1093/nar/gkh133. View

2.
Martinez-Castilla L, Alvarez-Buylla E . Adaptive evolution in the Arabidopsis MADS-box gene family inferred from its complete resolved phylogeny. Proc Natl Acad Sci U S A. 2003; 100(23):13407-12. PMC: 263827. DOI: 10.1073/pnas.1835864100. View

3.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

4.
Carlson J, Leebens-Mack J, Wall P, Zahn L, Mueller L, Landherr L . EST database for early flower development in California poppy (Eschscholzia californica Cham., Papaveraceae) tags over 6,000 genes from a basal eudicot. Plant Mol Biol. 2006; 62(3):351-69. DOI: 10.1007/s11103-006-9025-y. View

5.
Cui L, Wall P, Leebens-Mack J, Lindsay B, Soltis D, Doyle J . Widespread genome duplications throughout the history of flowering plants. Genome Res. 2006; 16(6):738-49. PMC: 1479859. DOI: 10.1101/gr.4825606. View