A Massively Parallel Strategy for STR Marker Development, Capture, and Genotyping
Overview
Authors
Affiliations
Short tandem repeat (STR) variants are highly polymorphic markers that facilitate powerful population genetic analyses. STRs are especially valuable in conservation and ecological genetic research, yielding detailed information on population structure and short-term demographic fluctuations. Massively parallel sequencing has not previously been leveraged for scalable, efficient STR recovery. Here, we present a pipeline for developing STR markers directly from high-throughput shotgun sequencing data without a reference genome, and an approach for highly parallel target STR recovery. We employed our approach to capture a panel of 5000 STRs from a test group of diademed sifakas (Propithecus diadema, n = 3), endangered Malagasy rainforest lemurs, and we report extremely efficient recovery of targeted loci-97.3-99.6% of STRs characterized with ≥10x non-redundant sequence coverage. We then tested our STR capture strategy on P. diadema fecal DNA, and report robust initial results and suggestions for future implementations. In addition to STR targets, this approach also generates large, genome-wide single nucleotide polymorphism (SNP) panels from flanking regions. Our method provides a cost-effective and scalable solution for rapid recovery of large STR and SNP datasets in any species without needing a reference genome, and can be used even with suboptimal DNA more easily acquired in conservation and ecological studies.
Wang X, Budowle B, Ge J BMC Bioinformatics. 2022; 23(1):497.
PMID: 36402991 PMC: 9675219. DOI: 10.1186/s12859-022-05021-1.
Chen J, Li F, Wang M, Li J, Marquez-Lago T, Leier A Front Big Data. 2022; 4:727216.
PMID: 35118375 PMC: 8805145. DOI: 10.3389/fdata.2021.727216.
Population-level inferences from environmental DNA-Current status and future perspectives.
Sigsgaard E, Jensen M, Winkelmann I, Rask Moller P, Hansen M, Thomsen P Evol Appl. 2020; 13(2):245-262.
PMID: 31993074 PMC: 6976968. DOI: 10.1111/eva.12882.
Genetic and genomic monitoring with minimally invasive sampling methods.
Carroll E, Bruford M, DeWoody J, Leroy G, Strand A, Waits L Evol Appl. 2018; 11(7):1094-1119.
PMID: 30026800 PMC: 6050181. DOI: 10.1111/eva.12600.
SONiCS: PCR stutter noise correction in genome-scale microsatellites.
Kedzierska K, Gerber L, Cagnazzi D, Krutzen M, Ratan A, Kistler L Bioinformatics. 2018; 34(23):4115-4117.
PMID: 29931218 PMC: 6454461. DOI: 10.1093/bioinformatics/bty485.