» Articles » PMID: 23471300

FreeIbis: an Efficient Basecaller with Calibrated Quality Scores for Illumina Sequencers

Overview
Journal Bioinformatics
Specialty Biology
Date 2013 Mar 9
PMID 23471300
Citations 33
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: The conversion of the raw intensities obtained from next-generation sequencing platforms into nucleotide sequences with well-calibrated quality scores is a critical step in the generation of good sequence data. While recent model-based approaches can yield highly accurate calls, they require a substantial amount of processing time and/or computational resources. We previously introduced Ibis, a fast and accurate basecaller for the Illumina platform. We have continued active development of Ibis to take into account developments in the Illumina technology, as well as to make Ibis fully open source.

Results: We introduce here freeIbis, which offers significant improvements in sequence accuracy owing to the use of a novel multiclass support vector machine (SVM) algorithm. Sequence quality scores are now calibrated based on empirically observed scores, thus providing a high correlation to their respective error rates. These improvements result in downstream advantages including improved genotyping accuracy.

Availability And Implementation: FreeIbis is freely available for use under the GPL (http://bioinf.eva.mpg.de/freeibis/). It requires a Python interpreter and a C++ compiler. Tailored versions of LIBOCAS and LIBLINEAR are distributed along with the package.

Citing Articles

Draft genome sequence of a endosymbiont from (Fritsch, 1958) (Acari, Syringophilidae).

Glowska E, Gerth M Microbiol Resour Announc. 2023; 12(11):e0060523.

PMID: 37882523 PMC: 10652925. DOI: 10.1128/MRA.00605-23.


Melanoma Single-Cell Biology in Experimental and Clinical Settings.

Binder H, Schmidt M, Loeffler-Wirth H, Mortensen L, Kunz M J Clin Med. 2021; 10(3).

PMID: 33535416 PMC: 7867095. DOI: 10.3390/jcm10030506.


Infection Patterns and Fitness Effects of and Symbionts in the Green Lacewing .

Sontowski R, Gerth M, Richter S, Gruppe A, Schlegel M, van Dam N Insects. 2020; 11(12).

PMID: 33297293 PMC: 7762206. DOI: 10.3390/insects11120867.


New perspectives on Neanderthal dispersal and turnover from Stajnia Cave (Poland).

Picin A, Hajdinjak M, Nowaczewska W, Benazzi S, Urbanowski M, Marciszak A Sci Rep. 2020; 10(1):14778.

PMID: 32901061 PMC: 7479612. DOI: 10.1038/s41598-020-71504-x.


Bayesian localization of CNV candidates in WGS data within minutes.

Wiedenhoeft J, Cagan A, Kozhemyakina R, Gulevich R, Schliep A Algorithms Mol Biol. 2019; 14:20.

PMID: 31572486 PMC: 6757390. DOI: 10.1186/s13015-019-0154-7.


References
1.
Whiteford N, Skelly T, Curtis C, Ritchie M, Lohr A, Zaranek A . Swift: primary data analysis for the Illumina Solexa sequencing platform. Bioinformatics. 2009; 25(17):2194-9. PMC: 2734321. DOI: 10.1093/bioinformatics/btp383. View

2.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View

3.
Kircher M, Stenzel U, Kelso J . Improved base calling for the Illumina Genome Analyzer using machine learning strategies. Genome Biol. 2009; 10(8):R83. PMC: 2745764. DOI: 10.1186/gb-2009-10-8-r83. View

4.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A . The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20(9):1297-303. PMC: 2928508. DOI: 10.1101/gr.107524.110. View

5.
Erlich Y, Mitra P, DelaBastide M, McCombie W, Hannon G . Alta-Cyclic: a self-optimizing base caller for next-generation sequencing. Nat Methods. 2008; 5(8):679-82. PMC: 2978646. DOI: 10.1038/nmeth.1230. View