» Articles » PMID: 39110522

ModDotPlot-rapid and Interactive Visualization of Tandem Repeats

Overview
Journal Bioinformatics
Specialty Biology
Date 2024 Aug 7
PMID 39110522
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy.

Results: In this work, we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mb genome of Arabidopsis thaliana in under 5 min on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes.

Availability And Implementation: ModDotPlot is available at https://github.com/marbl/ModDotPlot.

Citing Articles

EvANI benchmarking workflow for evolutionary distance estimation.

Majidian S, Hwang S, Zakeri M, Langmead B bioRxiv. 2025; .

PMID: 40027788 PMC: 11870633. DOI: 10.1101/2025.02.23.639716.


Repeat-based holocentromeres of the woodrush Luzula sylvatica reveal insights into the evolutionary transition to holocentricity.

Mata-Sucre Y, Kratka M, Oliveira L, Neumann P, Macas J, Schubert V Nat Commun. 2024; 15(1):9565.

PMID: 39500889 PMC: 11538461. DOI: 10.1038/s41467-024-53944-5.


Pangenome graph analysis reveals extensive effector copy-number variation in spinach downy mildew.

Skiadas P, Riera Vidal S, Dommisse J, Mendel M, Elberse J, Van den Ackerveken G PLoS Genet. 2024; 20(10):e1011452.

PMID: 39453979 PMC: 11540230. DOI: 10.1371/journal.pgen.1011452.


Complex genetic variation in nearly complete human genomes.

Logsdon G, Ebert P, Audano P, Loftus M, Porubsky D, Ebler J bioRxiv. 2024; .

PMID: 39372794 PMC: 11451754. DOI: 10.1101/2024.09.24.614721.


Genome assemblies for (Teleostei: Cichlidae) identify a novel candidate gene for vertebrate sex determination, RIN3.

Behrens K, Koblmuller S, Kocher T Front Genet. 2024; 15:1447628.

PMID: 39221227 PMC: 11361979. DOI: 10.3389/fgene.2024.1447628.


References
1.
Rhie A, Nurk S, Cechova M, Hoyt S, Taylor D, Altemose N . The complete sequence of a human Y chromosome. Nature. 2023; 621(7978):344-354. PMC: 10752217. DOI: 10.1038/s41586-023-06457-y. View

2.
Kille B, Garrison E, Treangen T, Phillippy A . Minmers are a generalization of minimizers that enable unbiased local Jaccard estimation. Bioinformatics. 2023; 39(9). PMC: 10505501. DOI: 10.1093/bioinformatics/btad512. View

3.
Vollger M, Kerpedjiev P, Phillippy A, Eichler E . StainedGlass: interactive visualization of massive tandem repeat structures with identity heatmaps. Bioinformatics. 2022; 38(7):2049-2051. PMC: 8963321. DOI: 10.1093/bioinformatics/btac018. View

4.
Wlodzimierz P, Rabanal F, Burns R, Naish M, Primetis E, Scott A . Cycles of satellite and transposon evolution in Arabidopsis centromeres. Nature. 2023; 618(7965):557-565. DOI: 10.1038/s41586-023-06062-z. View

5.
Sahlin K, Baudeau T, Cazaux B, Marchet C . A survey of mapping algorithms in the long-reads era. Genome Biol. 2023; 24(1):133. PMC: 10236595. DOI: 10.1186/s13059-023-02972-3. View