» Articles » PMID: 26893356

Using Intron Position Conservation for Homology-based Gene Prediction

Overview
Specialty Biochemistry
Date 2016 Feb 20
PMID 26893356
Citations 323
Authors
Affiliations
Soon will be listed here.
Abstract

Annotation of protein-coding genes is very important in bioinformatics and biology and has a decisive influence on many downstream analyses. Homology-based gene prediction programs allow for transferring knowledge about protein-coding genes from an annotated organism to an organism of interest.Here, we present a homology-based gene prediction program called GeMoMa. GeMoMa utilizes the conservation of intron positions within genes to predict related genes in other organisms. We assess the performance of GeMoMa and compare it with state-of-the-art competitors on plant and animal genomes using an extended best reciprocal hit approach. We find that GeMoMa often makes more precise predictions than its competitors yielding a substantially increased number of correct transcripts. Subsequently, we exemplarily validate GeMoMa predictions using Sanger sequencing. Finally, we use RNA-seq data to compare the predictions of homology-based gene prediction programs, and find again that GeMoMa performs well.Hence, we conclude that exploiting intron position conservation improves homology-based gene prediction, and we make GeMoMa freely available as command-line tool and Galaxy integration.

Citing Articles

A near-complete genome assembly of Fragaria iinumae.

Du H, He Y, Chen M, Zheng X, Gui D, Tang J BMC Genomics. 2025; 26(1):253.

PMID: 40087556 DOI: 10.1186/s12864-025-11440-0.


A comparative genomic analysis at the chromosomal-level reveals evolutionary patterns of aphid chromosomes.

Huang C, Ji B, Shi Z, Wang J, Yuan J, Yang P Commun Biol. 2025; 8(1):427.

PMID: 40082663 PMC: 11906883. DOI: 10.1038/s42003-025-07851-0.


Experimental Neurogenesis in the Embryos of the Gecko Paroedura picta.

Jimenez S, Senovilla-Ganzo R, Gallego-Flores T, Perez-Pascual E, Ordenana-Manso A, Rayo-Morales R Methods Mol Biol. 2025; 2899:127-145.

PMID: 40067621 DOI: 10.1007/978-1-0716-4386-0_9.


Combined genomic, transcriptomic, and metabolomic analyses provide insights into the fruit development of bottle gourd ().

He X, Zheng Y, Yang S, Wang Y, Lin Y, Jiang B Hortic Res. 2025; 12(3):uhae335.

PMID: 40051576 PMC: 11883228. DOI: 10.1093/hr/uhae335.


A telomere-to-telomere genome assembly of the protandrous hermaphrodite blackhead seabream, Acanthopagrus schlegelii.

Zhang K, Guo S, Yang S, Zhou W, Wu J, Zhang X Sci Data. 2025; 12(1):350.

PMID: 40016269 PMC: 11868651. DOI: 10.1038/s41597-025-04602-y.


References
1.
She R, Chu J, Uyar B, Wang J, Wang K, Chen N . genBlastG: using BLAST searches to build homologous gene models. Bioinformatics. 2011; 27(15):2141-3. DOI: 10.1093/bioinformatics/btr342. View

2.
Erickson J, Ziegler J, Guevara D, Abel S, Klosgen R, Mathur J . Agrobacterium-derived cytokinin influences plastid morphology and starch accumulation in Nicotiana benthamiana during transient assays. BMC Plant Biol. 2014; 14:127. PMC: 4062310. DOI: 10.1186/1471-2229-14-127. View

3.
Trapnell C, Williams B, Pertea G, Mortazavi A, Kwan G, van Baren M . Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010; 28(5):511-5. PMC: 3146043. DOI: 10.1038/nbt.1621. View

4.
Bamshad M, Ng S, Bigham A, Tabor H, Emond M, Nickerson D . Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet. 2011; 12(11):745-55. DOI: 10.1038/nrg3031. View

5.
Meyer I, Durbin R . Gene structure conservation aids similarity based gene prediction. Nucleic Acids Res. 2004; 32(2):776-83. PMC: 373336. DOI: 10.1093/nar/gkh211. View