» Articles » PMID: 30379572

Whole-Genome Alignment and Comparative Annotation

Overview
Publisher Annual Reviews
Date 2018 Nov 1
PMID 30379572
Citations 45
Authors
Affiliations
Soon will be listed here.
Abstract

Rapidly improving sequencing technology coupled with computational developments in sequence assembly are making reference-quality genome assembly economical. Hundreds of vertebrate genome assemblies are now publicly available, and projects are being proposed to sequence thousands of additional species in the next few years. Such dense sampling of the tree of life should give an unprecedented new understanding of evolution and allow a detailed determination of the events that led to the wealth of biodiversity around us. To gain this knowledge, these new genomes must be compared through genome alignment (at the sequence level) and comparative annotation (at the gene level). However, different alignment and annotation methods have different characteristics; before starting a comparative genomics analysis, it is important to understand the nature of, and biases and limitations inherent in, the chosen methods. This review is intended to act as a technical but high-level overview of the field that should provide this understanding. We briefly survey the state of the genome alignment and comparative annotation fields and potential future directions for these fields in a new, large-scale era of comparative genomics.

Citing Articles

A DNA language model based on multispecies alignment predicts the effects of genome-wide variants.

Benegas G, Albors C, Aw A, Ye C, Song Y Nat Biotechnol. 2025; .

PMID: 39747647 DOI: 10.1038/s41587-024-02511-w.


Unraveling the organellar genomic landscape of the therapeutic and entheogenic plant Mimosa tenuiflora: insights into genetic, structural, and evolutionary dynamics.

Trinca V, Silva S, Almeida J, Miranda V, Costa-Macedo J, Carnaval T Funct Integr Genomics. 2024; 25(1):3.

PMID: 39738702 DOI: 10.1007/s10142-024-01511-y.


KegAlign: Optimizing pairwise alignments with diagonal partitioning.

Gulhan A, Burhans R, Harris R, Kandemir M, Haeussler M, Nekrutenko A bioRxiv. 2024; .

PMID: 39282333 PMC: 11398343. DOI: 10.1101/2024.09.02.610839.


Unraveling genomic features and phylogenomics through the analysis of three Mexican endemic genomes.

Gutierrez E, Maldonado J, Castellanos-Morales G, Eguiarte L, Martinez-Mendez N, Ortega J PeerJ. 2024; 12:e17651.

PMID: 38993980 PMC: 11238727. DOI: 10.7717/peerj.17651.


ACMGA: a reference-free multiple-genome alignment pipeline for plant species.

Zhou H, Su X, Song B BMC Genomics. 2024; 25(1):515.

PMID: 38796435 PMC: 11127342. DOI: 10.1186/s12864-024-10430-y.


References
1.
Robinson G, Hackett K, Purcell-Miramontes M, Brown S, Evans J, Goldsmith M . Creating a buzz about insect genomes. Science. 2011; 331(6023):1386. DOI: 10.1126/science.331.6023.1386. View

2.
Flicek P, Keibler E, Hu P, Korf I, Brent M . Leveraging the mouse genome for gene prediction in human: from whole-genome shotgun reads to a global synteny map. Genome Res. 2003; 13(1):46-54. PMC: 430948. DOI: 10.1101/gr.830003. View

3.
Rivas E, Eddy S . Noncoding RNA gene detection using comparative sequence analysis. BMC Bioinformatics. 2002; 2:8. PMC: 64605. DOI: 10.1186/1471-2105-2-8. View

4.
Mirarab S, Bayzid M, Boussau B, Warnow T . Statistical binning enables an accurate coalescent-based estimation of the avian tree. Science. 2014; 346(6215):1250463. DOI: 10.1126/science.1250463. View

5.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View