Haplotype-aware Variant Calling with PEPPER-Margin-DeepVariant Enables High Accuracy in Nanopore Long-reads
Overview
Pathology
Authors
Affiliations
Long-read sequencing has the potential to transform variant detection by reaching currently difficult-to-map regions and routinely linking together adjacent variations to enable read-based phasing. Third-generation nanopore sequence data have demonstrated a long read length, but current interpretation methods for their novel pore-based signal have unique error profiles, making accurate analysis challenging. Here, we introduce a haplotype-aware variant calling pipeline, PEPPER-Margin-DeepVariant, that produces state-of-the-art variant calling results with nanopore data. We show that our nanopore-based method outperforms the short-read-based single-nucleotide-variant identification method at the whole-genome scale and produces high-quality single-nucleotide variants in segmental duplications and low-mappability regions where short-read-based genotyping fails. We show that our pipeline can provide highly contiguous phase blocks across the genome with nanopore reads, contiguously spanning between 85% and 92% of annotated genes across six samples. We also extend PEPPER-Margin-DeepVariant to PacBio HiFi data, providing an efficient solution with superior performance over the current WhatsHap-DeepVariant standard. Finally, we demonstrate de novo assembly polishing methods that use nanopore and PacBio HiFi reads to produce diploid assemblies with high accuracy (Q35+ nanopore-polished and Q40+ PacBio HiFi-polished).
Mizuguchi T, Okamoto N, Hara T, Nishimura N, Sakamoto M, Fu L Clin Epigenetics. 2025; 17(1):27.
PMID: 39966947 PMC: 11837588. DOI: 10.1186/s13148-025-01832-0.
Baasner J, Rempel A, Howard D, Pucker B PLoS Comput Biol. 2025; 21(2):e1012732.
PMID: 39964984 PMC: 11849982. DOI: 10.1371/journal.pcbi.1012732.
Ikeda J, Shiba N, Kato S, Kunimoto H, Saito Y, Sagisaka M Int J Hematol. 2025; .
PMID: 39891826 DOI: 10.1007/s12185-025-03929-x.
Dotto-Maurel A, Pelletier C, Degremont L, Heurtebise S, Arzul I, Morga B Microbiol Spectr. 2025; 13(3):e0208224.
PMID: 39846760 PMC: 11878034. DOI: 10.1128/spectrum.02082-24.
Zhou C, Gong T, Li S, Jin L, Fan S Sci China Life Sci. 2025; .
PMID: 39821835 DOI: 10.1007/s11427-024-2742-y.