» Articles » PMID: 36743210

NanoSTR: A Method for Detection of Target Short Tandem Repeats Based on Nanopore Sequencing Data

Overview
Specialty Biology
Date 2023 Feb 6
PMID 36743210
Authors
Affiliations
Soon will be listed here.
Abstract

Short tandem repeats (STRs) are widely present in the human genome. Studies have confirmed that STRs are associated with more than 30 diseases, and they have also been used in forensic identification and paternity testing. However, there are few methods for STR detection based on nanopore sequencing due to the challenges posed by the sequencing principles and the data characteristics of nanopore sequencing. We developed NanoSTR for detection of target STR loci based on the length-number-rank (LNR) information of reads. NanoSTR can be used for STR detection and genotyping based on long-read data from nanopore sequencing with improved accuracy and efficiency compared with other existing methods, such as Tandem-Genotypes and TRiCoLOR. NanoSTR showed 100% concordance with the expected genotypes using error-free simulated data, and also achieved >85% concordance using the standard samples (containing autosomal and Y-chromosomal loci) with MinION sequencing platform, respectively. NanoSTR showed high performance for detection of target STR markers. Although NanoSTR needs further optimization and development, it is useful as an analytical method for the detection of STR loci by nanopore sequencing. This method adds to the toolbox for nanopore-based STR analysis and expands the applications of nanopore sequencing in scientific research and clinical scenarios. The main code and the data are available at https://github.com/langjidong/NanoSTR.

Citing Articles

Navigating triplet repeats sequencing: concepts, methodological challenges and perspective for Huntington's disease.

Maestri S, Scalzo D, Damaggio G, Zobel M, Besusso D, Cattaneo E Nucleic Acids Res. 2024; 53(1.

PMID: 39676657 PMC: 11724279. DOI: 10.1093/nar/gkae1155.


NASTRA: accurate analysis of short tandem repeat markers by nanopore sequencing with repeat-structure-aware algorithm.

Ren Z, Zhang J, Zhang Y, Yang T, Sun P, Xue J Brief Bioinform. 2024; 25(6).

PMID: 39322627 PMC: 11424183. DOI: 10.1093/bib/bbae472.


A comparison of Oxford nanopore library strategies for bacterial genomics.

Sauvage T, Cormier A, Delphine P BMC Genomics. 2023; 24(1):627.

PMID: 37864145 PMC: 10589936. DOI: 10.1186/s12864-023-09729-z.

References
1.
Mitsuhashi S, Frith M, Mizuguchi T, Miyatake S, Toyota T, Adachi H . Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads. Genome Biol. 2019; 20(1):58. PMC: 6425644. DOI: 10.1186/s13059-019-1667-6. View

2.
Paulson H . Repeat expansion diseases. Handb Clin Neurol. 2018; 147:105-123. PMC: 6485936. DOI: 10.1016/B978-0-444-63233-3.00009-9. View

3.
Rang F, Kloosterman W, de Ridder J . From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biol. 2018; 19(1):90. PMC: 6045860. DOI: 10.1186/s13059-018-1462-9. View

4.
Collins J, Stephens R, Gold B, Long B, Dean M, Burt S . An exhaustive DNA micro-satellite map of the human genome using high performance computing. Genomics. 2003; 82(1):10-9. DOI: 10.1016/s0888-7543(03)00076-4. View

5.
Harris R, Cechova M, Makova K . Noise-cancelling repeat finder: uncovering tandem repeats in error-prone long-read sequencing data. Bioinformatics. 2019; 35(22):4809-4811. PMC: 6853708. DOI: 10.1093/bioinformatics/btz484. View