Nucleotide-resolution Analysis of Structural Variants Using BreakSeq and a Breakpoint Library
Overview
Authors
Affiliations
Structural variants (SVs) are a major source of human genomic variation; however, characterizing them at nucleotide resolution remains challenging. Here we assemble a library of breakpoints at nucleotide resolution from collating and standardizing ~2,000 published SVs. For each breakpoint, we infer its ancestral state (through comparison to primate genomes) and its mechanism of formation (e.g., nonallelic homologous recombination, NAHR). We characterize breakpoint sequences with respect to genomic landmarks, chromosomal location, sequence motifs and physical properties, finding that the occurrence of insertions and deletions is more balanced than previously reported and that NAHR-formed breakpoints are associated with relatively rigid, stable DNA helices. Finally, we demonstrate an approach, BreakSeq, for scanning the reads from short-read sequenced genomes against our breakpoint library to accurately identify previously overlooked SVs, which we then validate by PCR. As new data become available, we expect our BreakSeq approach will become more sensitive and facilitate rapid SV genotyping of personal genomes.
Mapping recurrent mosaic copy number variation in human neurons.
Sun C, Kathuria K, Emery S, Kim B, Burbulis I, Shin J Nat Commun. 2024; 15(1):4220.
PMID: 38760338 PMC: 11101435. DOI: 10.1038/s41467-024-48392-0.
Small polymorphisms are a source of ancestral bias in structural variant breakpoint placement.
Audano P, Beck C Genome Res. 2024; 34(1):7-19.
PMID: 38176712 PMC: 10904011. DOI: 10.1101/gr.278203.123.
Shiraishi Y, Koya J, Chiba K, Okada A, Arai Y, Saito Y Nucleic Acids Res. 2023; 51(14):e74.
PMID: 37336583 PMC: 10415145. DOI: 10.1093/nar/gkad526.
Karamysheva T, Gayner T, Elisaphenko E, Trifonov V, Zakirova E, Orishchenko K Biomedicines. 2022; 10(12).
PMID: 36552011 PMC: 9775520. DOI: 10.3390/biomedicines10123255.
Kim J, Huang A, Johnson S, Lai J, Isacco L, Jeffries A Nat Commun. 2022; 13(1):5918.
PMID: 36207339 PMC: 9546902. DOI: 10.1038/s41467-022-33642-w.