» Articles » PMID: 38704592

A New Long-read Mitochondrial-genome Protocol (PacBio HiFi) for Haemosporidian Parasites: a Tool for Population and Biodiversity Studies

Overview
Journal Malar J
Publisher Biomed Central
Specialty Tropical Medicine
Date 2024 May 4
PMID 38704592
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Studies on haemosporidian diversity, including origin of human malaria parasites, malaria's zoonotic dynamic, and regional biodiversity patterns, have used target gene approaches. However, current methods have a trade-off between scalability and data quality. Here, a long-read Next-Generation Sequencing protocol using PacBio HiFi is presented. The data processing is supported by a pipeline that uses machine-learning for analysing the reads.

Methods: A set of primers was designed to target approximately 6 kb, almost the entire length of the haemosporidian mitochondrial genome. Amplicons from different samples were multiplexed in an SMRTbell® library preparation. A pipeline (HmtG-PacBio Pipeline) to process the reads is also provided; it integrates multiple sequence alignments, a machine-learning algorithm that uses modified variational autoencoders, and a clustering method to identify the mitochondrial haplotypes/species in a sample. Although 192 specimens could be studied simultaneously, a pilot experiment with 15 specimens is presented, including in silico experiments where multiple data combinations were tested.

Results: The primers amplified various haemosporidian parasite genomes and yielded high-quality mt genome sequences. This new protocol allowed the detection and characterization of mixed infections and co-infections in the samples. The machine-learning approach converged into reproducible haplotypes with a low error rate, averaging 0.2% per read (minimum of 0.03% and maximum of 0.46%). The minimum recommended coverage per haplotype is 30X based on the detected error rates. The pipeline facilitates inspecting the data, including a local blast against a file of provided mitochondrial sequences that the researcher can customize.

Conclusions: This is not a diagnostic approach but a high-throughput method to study haemosporidian sequence assemblages and perform genotyping by targeting the mitochondrial genome. Accordingly, the methodology allowed for examining specimens with multiple infections and co-infections of different haemosporidian parasites. The pipeline enables data quality assessment and comparison of the haplotypes obtained to those from previous studies. Although a single locus approach, whole mitochondrial data provide high-quality information to characterize species pools of haemosporidian parasites.

References
1.
Bernotiene R, Palinauskas V, Iezhova T, Murauskaite D, Valkiunas G . Avian haemosporidian parasites (Haemosporida): A comparative analysis of different polymerase chain reaction assays in detection of mixed infections. Exp Parasitol. 2016; 163:31-7. DOI: 10.1016/j.exppara.2016.01.009. View

2.
Perez-Tris J, Bensch S . Diagnosing genetically diverse avian malarial infections using mixed-sequence analysis and TA-cloning. Parasitology. 2005; 131(Pt 1):15-23. DOI: 10.1017/s003118200500733x. View

3.
Katoh K, Standley D . MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013; 30(4):772-80. PMC: 3603318. DOI: 10.1093/molbev/mst010. View

4.
Cheng Q, Cunningham J, Gatton M . Systematic review of sub-microscopic P. vivax infections: prevalence and determining factors. PLoS Negl Trop Dis. 2015; 9(1):e3413. PMC: 4288718. DOI: 10.1371/journal.pntd.0003413. View

5.
Lee K, Divis P, Zakaria S, Matusop A, Julin R, Conway D . Plasmodium knowlesi: reservoir hosts and tracking the emergence in humans and macaques. PLoS Pathog. 2011; 7(4):e1002015. PMC: 3072369. DOI: 10.1371/journal.ppat.1002015. View