» Articles » PMID: 23776689

Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies

Overview
Journal PLoS One
Date 2013 Jun 19
PMID 23776689
Citations 42
Authors
Affiliations
Soon will be listed here.
Abstract

The emergence of high-throughput, next-generation sequencing technologies has dramatically altered the way we assess genomes in population genetics and in cancer genomics. Currently, there are four commonly used whole-genome sequencing platforms on the market: Illumina's HiSeq2000, Life Technologies' SOLiD 4 and its completely redesigned 5500xl SOLiD, and Complete Genomics' technology. A number of earlier studies have compared a subset of those sequencing platforms or compared those platforms with Sanger sequencing, which is prohibitively expensive for whole genome studies. Here we present a detailed comparison of the performance of all currently available whole genome sequencing platforms, especially regarding their ability to call SNVs and to evenly cover the genome and specific genomic regions. Unlike earlier studies, we base our comparison on four different samples, allowing us to assess the between-sample variation of the platforms. We find a pronounced GC bias in GC-rich regions for Life Technologies' platforms, with Complete Genomics performing best here, while we see the least bias in GC-poor regions for HiSeq2000 and 5500xl. HiSeq2000 gives the most uniform coverage and displays the least sample-to-sample variation. In contrast, Complete Genomics exhibits by far the smallest fraction of bases not covered, while the SOLiD platforms reveal remarkable shortcomings, especially in covering CpG islands. When comparing the performance of the four platforms for calling SNPs, HiSeq2000 and Complete Genomics achieve the highest sensitivity, while the SOLiD platforms show the lowest false positive rate. Finally, we find that integrating sequencing data from different platforms offers the potential to combine the strengths of different technologies. In summary, our results detail the strengths and weaknesses of all four whole-genome sequencing platforms. It indicates application areas that call for a specific sequencing platform and disallow other platforms. This helps to identify the proper sequencing platform for whole genome studies with different application scopes.

Citing Articles

Whole-Genome Sequence and Pathogenicity Analysis of Providencia Heimbachae Causing Diarrhea in Weaned Piglets.

Xiang K, Zhang Z, Li N, Zhang P, Liu F, Li H Curr Microbiol. 2023; 80(11):364.

PMID: 37812274 DOI: 10.1007/s00284-023-03478-8.


Genetic identification of avian samples recovered from solar energy installations.

Gruppi C, Sanzenbacher P, Balekjian K, Hagar R, Hagen S, Rayne C PLoS One. 2023; 18(9):e0289949.

PMID: 37672506 PMC: 10482291. DOI: 10.1371/journal.pone.0289949.


Methods to improve the accuracy of next-generation sequencing.

Cheng C, Fei Z, Xiao P Front Bioeng Biotechnol. 2023; 11:982111.

PMID: 36741756 PMC: 9895957. DOI: 10.3389/fbioe.2023.982111.


Evaluation of the correctable decoding sequencing as a new powerful strategy for DNA sequencing.

Cheng C, Xiao P Life Sci Alliance. 2022; 5(8).

PMID: 35422436 PMC: 9012935. DOI: 10.26508/lsa.202101294.


Identification of Copy Number Alterations from Next-Generation Sequencing Data.

Nabavi S, Zare F Adv Exp Med Biol. 2022; 1361:55-74.

PMID: 35230683 DOI: 10.1007/978-3-030-91836-1_4.


References
1.
Luo C, Tsementzi D, Kyrpides N, Read T, Konstantinidis K . Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample. PLoS One. 2012; 7(2):e30087. PMC: 3277595. DOI: 10.1371/journal.pone.0030087. View

2.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View

3.
Abecasis G, Altshuler D, Auton A, Brooks L, Durbin R, Gibbs R . A map of human genome variation from population-scale sequencing. Nature. 2010; 467(7319):1061-73. PMC: 3042601. DOI: 10.1038/nature09534. View

4.
Irizarry R, Ladd-Acosta C, Wen B, Wu Z, Montano C, Onyango P . The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat Genet. 2009; 41(2):178-186. PMC: 2729128. DOI: 10.1038/ng.298. View

5.
Suzuki S, Ono N, Furusawa C, Ying B, Yomo T . Comparison of sequence reads obtained from three next-generation sequencing platforms. PLoS One. 2011; 6(5):e19534. PMC: 3096631. DOI: 10.1371/journal.pone.0019534. View