In-solution Y-chromosome Capture-enrichment on Ancient DNA Libraries
Overview
Authors
Affiliations
Background: As most ancient biological samples have low levels of endogenous DNA, it is advantageous to enrich for specific genomic regions prior to sequencing. One approach-in-solution capture-enrichment-retrieves sequences of interest and reduces the fraction of microbial DNA. In this work, we implement a capture-enrichment approach targeting informative regions of the Y chromosome in six human archaeological remains excavated in the Caribbean and dated between 200 and 3000 years BP. We compare the recovery rate of Y-chromosome capture (YCC) alone, whole-genome capture followed by YCC (WGC + YCC) versus non-enriched (pre-capture) libraries.
Results: The six samples show different levels of initial endogenous content, with very low (< 0.05%, 4 samples) or low (0.1-1.54%, 2 samples) percentages of sequenced reads mapping to the human genome. We recover 12-9549 times more targeted unique Y-chromosome sequences after capture, where 0.0-6.2% (WGC + YCC) and 0.0-23.5% (YCC) of the sequence reads were on-target, compared to 0.0-0.00003% pre-capture. In samples with endogenous DNA content greater than 0.1%, we found that WGC followed by YCC (WGC + YCC) yields lower enrichment due to the loss of complexity in consecutive capture experiments, whereas in samples with lower endogenous content, the libraries' initial low complexity leads to minor proportions of Y-chromosome reads. Finally, increasing recovery of informative sites enabled us to assign Y-chromosome haplogroups to some of the archeological remains and gain insights about their paternal lineages and origins.
Conclusions: We present to our knowledge the first in-solution capture-enrichment method targeting the human Y-chromosome in aDNA sequencing libraries. YCC and WGC + YCC enrichments lead to an increase in the amount of Y-DNA sequences, as compared to libraries not enriched for the Y-chromosome. Our probe design effectively recovers regions of the Y-chromosome bearing phylogenetically informative sites, allowing us to identify paternal lineages with less sequencing than needed for pre-capture libraries. Finally, we recommend considering the endogenous content in the experimental design and avoiding consecutive rounds of capture, as clonality increases considerably with each round.
Garcia-Fernandez C, Lizano E, Telford M, Olalde I, de Cid R, Larmuseau M Sci Rep. 2022; 12(1):20708.
PMID: 36456614 PMC: 9715704. DOI: 10.1038/s41598-022-25200-7.
Rohrlach A, Papac L, Childebayeva A, Rivollat M, Villalba-Mouco V, Neumann G Sci Rep. 2021; 11(1):15005.
PMID: 34294811 PMC: 8298398. DOI: 10.1038/s41598-021-94491-z.
Bravo-Lopez M, Villa-Islas V, Rocha Arriaga C, Villasenor-Altamirano A, Guzman-Solis A, Sandoval-Velasco M Philos Trans R Soc Lond B Biol Sci. 2020; 375(1812):20190580.
PMID: 33012233 PMC: 7702795. DOI: 10.1098/rstb.2019.0580.
Pierini F, Nutsua M, Bohme L, Ozer O, Bonczarowska J, Susat J Sci Rep. 2020; 10(1):7339.
PMID: 32355290 PMC: 7193575. DOI: 10.1038/s41598-020-64312-w.
The efficacy of whole human genome capture on ancient dental calculus and dentin.
Ziesemer K, Ramos-Madrigal J, Mann A, Brandt B, Sankaranarayanan K, Ozga A Am J Phys Anthropol. 2018; 168(3):496-509.
PMID: 30586168 PMC: 6519167. DOI: 10.1002/ajpa.23763.