» Articles » PMID: 25288881

Review of Current Methods, Applications, and Data Management for the Bioinformatics Analysis of Whole Exome Sequencing

Overview
Journal Cancer Inform
Publisher Sage Publications
Date 2014 Oct 8
PMID 25288881
Citations 76
Authors
Affiliations
Soon will be listed here.
Abstract

The advent of next-generation sequencing technologies has greatly promoted advances in the study of human diseases at the genomic, transcriptomic, and epigenetic levels. Exome sequencing, where the coding region of the genome is captured and sequenced at a deep level, has proven to be a cost-effective method to detect disease-causing variants and discover gene targets. In this review, we outline the general framework of whole exome sequence data analysis. We focus on established bioinformatics tools and applications that support five analytical steps: raw data quality assessment, pre-processing, alignment, post-processing, and variant analysis (detection, annotation, and prioritization). We evaluate the performance of open-source alignment programs and variant calling tools using simulated and benchmark datasets, and highlight the challenges posed by the lack of concordance among variant detection tools. Based on these results, we recommend adopting multiple tools and resources to reduce false positives and increase the sensitivity of variant calling. In addition, we briefly discuss the current status and solutions for big data management, analysis, and summarization in the field of bioinformatics.

Citing Articles

Omics Biology in Diagnosis of Diseases: Towards Empowering Genomic Medicine from an Evolutionary Perspective.

Maldonado E, Khan I Life (Basel). 2025; 14(12.

PMID: 39768344 PMC: 11679243. DOI: 10.3390/life14121637.


From Tradition to Innovation: Diverse Molecular Techniques in the Fight Against Infectious Diseases.

Alsharksi A, Sirekbasan S, Gurkok-Tan T, Mustapha A Diagnostics (Basel). 2025; 14(24.

PMID: 39767237 PMC: 11674978. DOI: 10.3390/diagnostics14242876.


PathVar: A Customisable NGS Variant Calling Algorithm Implicates Novel Candidate Genes and Pathways in Hemiplegic Migraine.

Alfayyadh M, Maksemous N, Sutherland H, Lea R, Griffiths L Clin Genet. 2024; 107(2):157-168.

PMID: 39394929 PMC: 11725560. DOI: 10.1111/cge.14625.


NGS-Based Identification of Two Novel Mutations in Female Patients with Early-Onset Epilepsy.

Szalai R, Hadzsiev K, Till A, Fogarasi A, Bodo T, Buki G Int J Mol Sci. 2024; 25(11).

PMID: 38891919 PMC: 11171991. DOI: 10.3390/ijms25115732.


In silico bioprospecting of receptors associated with the mechanism of action of Rondonin, an antifungal peptide from spider haemolymph.

Muniz Seif E, Icimoto M, Junior P In Silico Pharmacol. 2024; 12(1):55.

PMID: 38863478 PMC: 11162988. DOI: 10.1007/s40203-024-00224-1.


References
1.
Liu X, Han S, Wang Z, Gelernter J, Yang B . Variant callers for next-generation sequencing data: a comparison study. PLoS One. 2013; 8(9):e75619. PMC: 3785481. DOI: 10.1371/journal.pone.0075619. View

2.
Rhodes D, Kalyana-Sundaram S, Mahavisno V, Varambally R, Yu J, Briggs B . Oncomine 3.0: genes, pathways, and networks in a collection of 18,000 cancer gene expression profiles. Neoplasia. 2007; 9(2):166-80. PMC: 1813932. DOI: 10.1593/neo.07112. View

3.
Ionita-Laza I, Buxbaum J, Laird N, Lange C . A new testing strategy to identify rare variants with either risk or protective effect on disease. PLoS Genet. 2011; 7(2):e1001289. PMC: 3033379. DOI: 10.1371/journal.pgen.1001289. View

4.
Kim S, Jeong K, Bhutani K, Lee J, Patel A, Scott E . Virmid: accurate detection of somatic mutations with sample impurity inference. Genome Biol. 2013; 14(8):R90. PMC: 4054681. DOI: 10.1186/gb-2013-14-8-r90. View

5.
Goh V, Helbling D, Biank V, Jarzembowski J, Dimmock D . Next-generation sequencing facilitates the diagnosis in a child with twinkle mutations causing cholestatic liver failure. J Pediatr Gastroenterol Nutr. 2011; 54(2):291-4. DOI: 10.1097/MPG.0b013e318227e53c. View