» Articles » PMID: 24174438

Improving Genome Assemblies and Annotations for Nonhuman Primates

Overview
Journal ILAR J
Date 2013 Nov 1
PMID 24174438
Citations 17
Authors
Affiliations
Soon will be listed here.
Abstract

The study of nonhuman primates (NHP) is key to understanding human evolution, in addition to being an important model for biomedical research. NHPs are especially important for translational medicine. There are now exciting opportunities to greatly increase the utility of these models by incorporating Next Generation (NextGen) sequencing into study design. Unfortunately, the draft status of nonhuman genomes greatly constrains what can currently be accomplished with available technology. Although all genomes contain errors, draft assemblies and annotations contain so many mistakes that they make currently available nonhuman primate genomes misleading to investigators conducting evolutionary studies; and these genomes are of insufficient quality to serve as references for NextGen studies. Fortunately, NextGen sequencing can be used in the production of greatly improved genomes. Existing Sanger sequences can be supplemented with NextGen whole genome, and exomic genomic sequences to create new, more complete and correct assemblies. Additional physical mapping, and an incorporation of information about gene structure, can be used to improve assignment of scaffolds to chromosomes. In addition, mRNA-sequence data can be used to economically acquire transcriptome information, which can be used for annotation. Some highly polymorphic and complex regions, for example MHC class I and immunoglobulin loci, will require extra effort to properly assemble and annotate. However, for the vast majority of genes, a modest investment in money, and a somewhat greater investment in time, can greatly improve assemblies and annotations sufficient to produce true, reference grade nonhuman primate genomes. Such resources can reasonably be expected to transform nonhuman primate research.

Citing Articles

Novel Insights into the Landscape of Crossover and Noncrossover Events in Rhesus Macaques (Macaca mulatta).

Versoza C, Weiss S, Johal R, La Rosa B, Jensen J, Pfeifer S Genome Biol Evol. 2023; 16(1).

PMID: 38051960 PMC: 10773715. DOI: 10.1093/gbe/evad223.


Revised eutherian gene collections.

Premzl M BMC Genom Data. 2022; 23(1):56.

PMID: 35870891 PMC: 9308196. DOI: 10.1186/s12863-022-01071-9.


MicroRNA Signatures of the Developing Primate Fovea.

Fishman E, Louie M, Miltner A, Cheema S, Wong J, Schlaeger N Front Cell Dev Biol. 2021; 9:654385.

PMID: 33898453 PMC: 8060505. DOI: 10.3389/fcell.2021.654385.


Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes.

Meyer C, Scalzitti N, Jeannin-Girardon A, Collet P, Poch O, Thompson J BMC Bioinformatics. 2020; 21(1):513.

PMID: 33172385 PMC: 7656754. DOI: 10.1186/s12859-020-03855-1.


Comparative genome-wide survey of single nucleotide variation uncovers the genetic diversity and potential biomedical applications among six Macaca species.

Fan Z, Sun T, Peng C, Yue B, Li J Int J Mol Sci. 2018; 19(10).

PMID: 30314376 PMC: 6212917. DOI: 10.3390/ijms19103123.


References
1.
Nagy A, Hegyi H, Farkas K, Tordai H, Kozma E, Banyai L . Identification and correction of abnormal, incomplete and mispredicted proteins in public databases. BMC Bioinformatics. 2008; 9:353. PMC: 2542381. DOI: 10.1186/1471-2105-9-353. View

2.
Marques-Bonet T, Ryder O, Eichler E . Sequencing primate genomes: what have we learned?. Annu Rev Genomics Hum Genet. 2009; 10:355-86. PMC: 6662594. DOI: 10.1146/annurev.genom.9.081307.164420. View

3.
Scally A, Dutheil J, Hillier L, Jordan G, Goodhead I, Herrero J . Insights into hominid evolution from the gorilla genome sequence. Nature. 2012; 483(7388):169-75. PMC: 3303130. DOI: 10.1038/nature10842. View

4.
Kehrer-Sawatzki H, Cooper D . Molecular mechanisms of chromosomal rearrangement during primate evolution. Chromosome Res. 2008; 16(1):41-56. DOI: 10.1007/s10577-007-1207-1. View

5.
Zerbino D, Birney E . Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008; 18(5):821-9. PMC: 2336801. DOI: 10.1101/gr.074492.107. View