» Articles » PMID: 24501098

Visualization and Probability-based Scoring of Structural Variants Within Repetitive Sequences

Overview
Journal Bioinformatics
Specialty Biology
Date 2014 Feb 7
PMID 24501098
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Repetitive sequences account for approximately half of the human genome. Accurately ascertaining sequences in these regions with next generation sequencers is challenging, and requires a different set of analytical techniques than for reads originating from unique sequences. Complicating the matter are repetitive regions subject to programmed rearrangements, as is the case with the antigen-binding domains in the Immunoglobulin (Ig) and T-cell receptor (TCR) loci.

Results: We developed a probability-based score and visualization method to aid in distinguishing true structural variants from alignment artifacts. We demonstrate the usefulness of this method in its ability to separate real structural variants from false positives generated with existing upstream analysis tools. We validated our approach using both target-capture and whole-genome experiments. Capture sequencing reads were generated from primary lymphoid tumors, cancer cell lines and an EBV-transformed lymphoblast cell line over the Ig and TCR loci. Whole-genome sequencing reads were from a lymphoblastoid cell-line.

Availability: We implement our method as an R package available at https://github.com/Eitan177/targetSeqView. Code to reproduce the figures and results are also available.

Citing Articles

Visualization tools for human structural variations identified by whole-genome sequencing.

Yokoyama T, Kasahara M J Hum Genet. 2019; 65(1):49-60.

PMID: 31666648 PMC: 8075883. DOI: 10.1038/s10038-019-0687-0.

References
1.
Yu X, Guda K, Willis J, Veigl M, Wang Z, Markowitz S . How do alignment programs perform on sequencing data with varying qualities and from repetitive regions?. BioData Min. 2012; 5(1):6. PMC: 3414812. DOI: 10.1186/1756-0381-5-6. View

2.
Halper-Stromberg E, Steranka J, Giraldo-Castillo N, Fuller T, Desiderio S, Burns K . Fine mapping of V(D)J recombinase mediated rearrangements in human lymphoid malignancies. BMC Genomics. 2013; 14:565. PMC: 3846541. DOI: 10.1186/1471-2164-14-565. View

3.
Schinzel A . Microdeletion syndromes, balanced translocations, and gene mapping. J Med Genet. 1988; 25(7):454-62. PMC: 1050522. DOI: 10.1136/jmg.25.7.454. View

4.
Zhu M, Need A, Han Y, Ge D, Maia J, Zhu Q . Using ERDS to infer copy-number variants in high-coverage genomes. Am J Hum Genet. 2012; 91(3):408-21. PMC: 3511991. DOI: 10.1016/j.ajhg.2012.07.004. View

5.
Halper-Stromberg E, Frelin L, Ruczinski I, Scharpf R, Jie C, Carvalho B . Performance assessment of copy number microarray platforms using a spike-in experiment. Bioinformatics. 2011; 27(8):1052-60. PMC: 3072561. DOI: 10.1093/bioinformatics/btr106. View