» Articles » PMID: 21310087

Identification of Errors Introduced During High Throughput Sequencing of the T Cell Receptor Repertoire

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2011 Feb 12
PMID 21310087
Citations 45
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Recent advances in massively parallel sequencing have increased the depth at which T cell receptor (TCR) repertoires can be probed by >3log10, allowing for saturation sequencing of immune repertoires. The resolution of this sequencing is dependent on its accuracy, and direct assessments of the errors formed during high throughput repertoire analyses are limited.

Results: We analyzed 3 monoclonal TCR from TCR transgenic, Rag-/- mice using Illumina® sequencing. A total of 27 sequencing reactions were performed for each TCR using a trifurcating design in which samples were divided into 3 at significant processing junctures. More than 20 million complementarity determining region (CDR) 3 sequences were analyzed. Filtering for lower quality sequences diminished but did not eliminate sequence errors, which occurred within 1-6% of sequences. Erroneous sequences were pre-dominantly of correct length and contained single nucleotide substitutions. Rates of specific substitutions varied dramatically in a position-dependent manner. Four substitutions, all purine-pyrimidine transversions, predominated. Solid phase amplification and sequencing rather than liquid sample amplification and preparation appeared to be the primary sources of error. Analysis of polyclonal repertoires demonstrated the impact of error accumulation on data parameters.

Conclusions: Caution is needed in interpreting repertoire data due to potential contamination with mis-sequence reads. However, a high association of errors with phred score, high relatedness of erroneous sequences with the parental sequence, dominance of specific nt substitutions, and skewed ratio of forward to reverse reads among erroneous sequences indicate approaches to filter erroneous sequences from repertoire data sets.

Citing Articles

Machine Learning Approaches to TCR Repertoire Analysis.

Katayama Y, Yokota R, Akiyama T, Kobayashi T Front Immunol. 2022; 13:858057.

PMID: 35911778 PMC: 9334875. DOI: 10.3389/fimmu.2022.858057.


Analysis of T-Cell Receptor Repertoire in Transplantation: Fingerprint of T Cell-mediated Alloresponse.

Tian G, Li M, Lv G Front Immunol. 2022; 12:778559.

PMID: 35095851 PMC: 8790170. DOI: 10.3389/fimmu.2021.778559.


High-throughput and single-cell T cell receptor sequencing technologies.

Pai J, Satpathy A Nat Methods. 2021; 18(8):881-892.

PMID: 34282327 PMC: 9345561. DOI: 10.1038/s41592-021-01201-8.


Dynamics of thymus function and T cell receptor repertoire breadth in health and disease.

Granadier D, Iovino L, Kinsella S, Dudakov J Semin Immunopathol. 2021; 43(1):119-134.

PMID: 33608819 PMC: 7894242. DOI: 10.1007/s00281-021-00840-5.


Rational "Error Elimination" Approach to Evaluating Molecular Barcoded Next-Generation Sequencing Data Identifies Low-Frequency Mutations in Hematologic Malignancies.

Mallampati S, Duose D, Harmon M, Mehrotra M, Kanagal-Shamanna R, Zalles S J Mol Diagn. 2019; 21(3):471-482.

PMID: 30794984 PMC: 6521894. DOI: 10.1016/j.jmoldx.2019.01.008.


References
1.
Day E, Carmichael A, Ten Berge I, Waller E, Sissons J, Wills M . Rapid CD8+ T cell repertoire focusing and selection of high-affinity clones into memory following primary infection with a persistent human virus: human cytomegalovirus. J Immunol. 2007; 179(5):3203-13. DOI: 10.4049/jimmunol.179.5.3203. View

2.
Liu X, Nguyen P, Liu W, Cheng C, Steeves M, Obenauer J . T cell receptor CDR3 sequence but not recognition characteristics distinguish autoreactive effector and Foxp3(+) regulatory T cells. Immunity. 2009; 31(6):909-20. PMC: 2878844. DOI: 10.1016/j.immuni.2009.09.023. View

3.
Venturi V, Price D, Douek D, Davenport M . The molecular basis for public T-cell responses?. Nat Rev Immunol. 2008; 8(3):231-8. DOI: 10.1038/nri2260. View

4.
Kedzierska K, La Gruta N, Stambas J, Turner S, Doherty P . Tracking phenotypically and functionally distinct T cell subsets via T cell repertoire diversity. Mol Immunol. 2007; 45(3):607-18. PMC: 2237887. DOI: 10.1016/j.molimm.2006.05.017. View

5.
Blank A, Gallant J, Burgess R, Loeb L . An RNA polymerase mutant with reduced accuracy of chain elongation. Biochemistry. 1986; 25(20):5920-8. DOI: 10.1021/bi00368a013. View