» Articles » PMID: 27142414

DAMe: a Toolkit for the Initial Processing of Datasets with PCR Replicates of Double-tagged Amplicons for DNA Metabarcoding Analyses

Overview
Journal BMC Res Notes
Publisher Biomed Central
Date 2016 May 5
PMID 27142414
Citations 15
Authors
Affiliations
Soon will be listed here.
Abstract

Background: DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way.

Results: We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe.

Conclusions: DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.

Citing Articles

Experimental evaluation of genetic variability based on DNA metabarcoding from the aquatic environment: Insights from the Leray fragment.

Turanov S, Koltsova M, Rutenko O Ecol Evol. 2024; 14(7):e11631.

PMID: 38966247 PMC: 11222756. DOI: 10.1002/ece3.11631.


Persisting roadblocks in arthropod monitoring using non-destructive metabarcoding from collection media of passive traps.

Sire L, Schmidt Yanez P, Bezier A, Courtial B, Mbedi S, Sparmann S PeerJ. 2023; 11:e16022.

PMID: 37842065 PMC: 10573316. DOI: 10.7717/peerj.16022.


Extracting abundance information from DNA-based data.

Luo M, Ji Y, Warton D, Yu D Mol Ecol Resour. 2022; 23(1):174-189.

PMID: 35986714 PMC: 10087802. DOI: 10.1111/1755-0998.13703.


Measuring protected-area effectiveness using vertebrate distributions from leech iDNA.

Ji Y, Baker C, Popescu V, Wang J, Wu C, Wang Z Nat Commun. 2022; 13(1):1555.

PMID: 35322033 PMC: 8943135. DOI: 10.1038/s41467-022-28778-8.


Climate-induced forest dieback drives compositional changes in insect communities that are more pronounced for rare species.

Sire L, Schmidt Yanez P, Wang C, Bezier A, Courtial B, Cours J Commun Biol. 2022; 5(1):57.

PMID: 35042989 PMC: 8766456. DOI: 10.1038/s42003-021-02968-4.


References
1.
Schnell I, Bohmann K, Gilbert M . Tag jumps illuminated--reducing sequence-to-sample misidentifications in metabarcoding studies. Mol Ecol Resour. 2015; 15(6):1289-303. DOI: 10.1111/1755-0998.12402. View

2.
Lindgreen S . AdapterRemoval: easy cleaning of next-generation sequencing reads. BMC Res Notes. 2012; 5:337. PMC: 3532080. DOI: 10.1186/1756-0500-5-337. View

3.
Thomas R, Nickerson E, Simons J, Janne P, Tengs T, Yuza Y . Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing. Nat Med. 2006; 12(7):852-5. DOI: 10.1038/nm1437. View

4.
Quince C, Lanzen A, Curtis T, Davenport R, Hall N, Head I . Accurate determination of microbial diversity from 454 pyrosequencing data. Nat Methods. 2009; 6(9):639-41. DOI: 10.1038/nmeth.1361. View

5.
Deagle B, Thomas A, Shaffer A, Trites A, Jarman S . Quantifying sequence proportions in a DNA-based diet study using Ion Torrent amplicon sequencing: which counts count?. Mol Ecol Resour. 2013; 13(4):620-33. DOI: 10.1111/1755-0998.12103. View