» Articles » PMID: 39570888

An Easy-to-use Pipeline to Analyze Amplicon-based Next Generation Sequencing Results of Human Mitochondrial DNA from Degraded Samples

Overview
Journal PLoS One
Date 2024 Nov 21
PMID 39570888
Authors
Affiliations
Soon will be listed here.
Abstract

Genome and transcriptome examinations have become more common due to Next-Generation Sequencing (NGS), which significantly increases throughput and depth coverage while reducing costs and time. Mitochondrial DNA (mtDNA) is often the marker of choice in degraded samples from archaeological and forensic contexts, as its higher number of copies can improve the success of the experiment. Among other sequencing strategies, amplicon-based NGS techniques are currently being used to obtain enough data to be analyzed. There are some pipelines designed for the analysis of ancient mtDNA samples and others for the analysis of amplicon data. However, these pipelines pose a challenge for non-expert users and cannot often address both ancient and forensic DNA particularities and amplicon-based sequencing simultaneously. To overcome these challenges, a user-friendly bioinformatic tool was developed to analyze the non-coding region of human mtDNA from degraded samples recovered in archaeological and forensic contexts. The tool can be easily modified to fit the specifications of other amplicon-based NGS experiments. A comparative analysis between two tools, MarkDuplicates from Picard and dedup parameter from fastp, both designed for duplicate removal was conducted. Additionally, various thresholds of PMDtools, a specialized tool designed for extracting reads affected by post-mortem damage, were used. Finally, the depth coverage of each amplicon was correlated with its level of damage. The results obtained indicated that, for removing duplicates, dedup is a better tool since retains more non-repeated reads, that are removed by MarkDuplicates. On the other hand, a PMDS = 1 in PMDtools was the threshold that allowed better differentiation between present-day and ancient samples, in terms of damage, without losing too many reads in the process. These two bioinformatic tools were added to a pipeline designed to obtain both haplotype and haplogroup of mtDNA. Furthermore, the pipeline presented in the present study generates information about the quality and possible contamination of the sample. This pipeline is designed to automatize mtDNA analysis, however, particularly for ancient samples, some manual analyses may be required to fully validate results since the amplicons that used to be more easily recovered were the ones that had fewer reads with damage, indicating that special care must be taken for poor recovered samples.

References
1.
Paabo S, Poinar H, Serre D, Jaenicke-Despres V, Hebler J, Rohland N . Genetic analyses from ancient DNA. Annu Rev Genet. 2004; 38:645-79. DOI: 10.1146/annurev.genet.37.110801.143214. View

2.
Sun S, Osterman M, Li M . Tissue specificity of DNA damage response and tumorigenesis. Cancer Biol Med. 2019; 16(3):396-414. PMC: 6743622. DOI: 10.20892/j.issn.2095-3941.2019.0097. View

3.
Peck M, Sturk-Andreaggi K, Thomas J, Oliver R, Barritt-Ross S, Marshall C . Developmental validation of a Nextera XT mitogenome Illumina MiSeq sequencing method for high-quality samples. Forensic Sci Int Genet. 2018; 34:25-36. DOI: 10.1016/j.fsigen.2018.01.004. View

4.
Diroma M, Modi A, Lari M, Sineo L, Caramelli D, Vai S . New Insights Into Mitochondrial DNA Reconstruction and Variant Detection in Ancient Samples. Front Genet. 2021; 12:619950. PMC: 7930628. DOI: 10.3389/fgene.2021.619950. View

5.
Weissensteiner H, Pacher D, Kloss-Brandstatter A, Forer L, Specht G, Bandelt H . HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 2016; 44(W1):W58-63. PMC: 4987869. DOI: 10.1093/nar/gkw233. View