» Articles » PMID: 24861618

RAMICS: Trainable, High-speed and Biologically Relevant Alignment of High-throughput Sequencing Reads to Coding DNA

Overview
Specialty Biochemistry
Date 2014 May 28
PMID 24861618
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

The challenge presented by high-throughput sequencing necessitates the development of novel tools for accurate alignment of reads to reference sequences. Current approaches focus on using heuristics to map reads quickly to large genomes, rather than generating highly accurate alignments in coding regions. Such approaches are, thus, unsuited for applications such as amplicon-based analysis and the realignment phase of exome sequencing and RNA-seq, where accurate and biologically relevant alignment of coding regions is critical. To facilitate such analyses, we have developed a novel tool, RAMICS, that is tailored to mapping large numbers of sequence reads to short lengths (<10 000 bp) of coding DNA. RAMICS utilizes profile hidden Markov models to discover the open reading frame of each sequence and aligns to the reference sequence in a biologically relevant manner, distinguishing between genuine codon-sized indels and frameshift mutations. This approach facilitates the generation of highly accurate alignments, accounting for the error biases of the sequencing machine used to generate reads, particularly at homopolymer regions. Performance improvements are gained through the use of graphics processing units, which increase the speed of mapping through parallelization. RAMICS substantially outperforms all other mapping approaches tested in terms of alignment quality while maintaining highly competitive speed performance.

Citing Articles

Identification of genetic risk variants for Type-2 Diabetes mellitus in Pakistani Pashtun population: A case-control association study.

Jan A, Mothana R, Kaimori J, Muhammad T, Khan M, Ali S Pak J Med Sci. 2024; 40(10):2336-2343.

PMID: 39554687 PMC: 11568737. DOI: 10.12669/pjms.40.10.10292.


Decoding type 2 diabetes mellitus genetic risk variants in Pakistani Pashtun ethnic population using the nascent whole exome sequencing and MassARRAY genotyping: A case-control association study.

Jan A, Zakiullah , Ali S, Muhammad B, Arshad A, Shah Y PLoS One. 2023; 18(1):e0281070.

PMID: 36730981 PMC: 9882913. DOI: 10.1371/journal.pone.0281070.


Dry Panels Supporting External Quality Assessment Programs for Next Generation Sequencing-Based HIV Drug Resistance Testing.

Noguera-Julian M, Lee E, Shafer R, Kantor R, Ji H Viruses. 2020; 12(6).

PMID: 32575676 PMC: 7354622. DOI: 10.3390/v12060666.


Cooperation between Strain-Specific and Broadly Neutralizing Responses Limited Viral Escape and Prolonged the Exposure of the Broadly Neutralizing Epitope.

Anthony C, York T, Bekker V, Matten D, Selhorst P, Ferreria R J Virol. 2017; 91(18).

PMID: 28679760 PMC: 5571269. DOI: 10.1128/JVI.00828-17.


Virome Assembly and Annotation: A Surprise in the Namib Desert.

Hesse U, van Heusden P, Kirby B, Olonade I, van Zyl L, Trindade M Front Microbiol. 2017; 8:13.

PMID: 28167933 PMC: 5253355. DOI: 10.3389/fmicb.2017.00013.


References
1.
Choi M, Scholl U, Ji W, Liu T, Tikhonova I, Zumbo P . Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc Natl Acad Sci U S A. 2009; 106(45):19096-101. PMC: 2768590. DOI: 10.1073/pnas.0910672106. View

2.
Guan X, Uberbacher E . Alignments of DNA and protein sequences containing frameshift errors. Comput Appl Biosci. 1996; 12(1):31-40. DOI: 10.1093/bioinformatics/12.1.31. View

3.
Bennett S . Solexa Ltd. Pharmacogenomics. 2004; 5(4):433-8. DOI: 10.1517/14622416.5.4.433. View

4.
Birney E, Clamp M, Durbin R . GeneWise and Genomewise. Genome Res. 2004; 14(5):988-95. PMC: 479130. DOI: 10.1101/gr.1865504. View

5.
Jiang T, Yang L, Jiang H, Tian G, Zhang X . High-performance single-chip exon capture allows accurate whole exome sequencing using the Illumina Genome Analyzer. Sci China Life Sci. 2011; 54(10):945-52. DOI: 10.1007/s11427-011-4232-4. View