» Articles » PMID: 35545449

Automated Annotation of Human Centromeres with HORmon

Overview
Journal Genome Res
Specialty Genetics
Date 2022 May 11
PMID 35545449
Authors
Affiliations
Soon will be listed here.
Abstract

Recent advances in long-read sequencing opened a possibility to address the long-standing questions about the architecture and evolution of human centromeres. They also emphasized the need for centromere annotation (partitioning human centromeres into monomers and higher-order repeats [HORs]). Although there was a half-century-long series of semi-manual studies of centromere architecture, a rigorous centromere annotation algorithm is still lacking. Moreover, an automated centromere annotation is a prerequisite for studies of genetic diseases associated with centromeres and evolutionary studies of centromeres across multiple species. Although the monomer decomposition (transforming a centromere into a monocentromere written in the monomer alphabet) and the HOR decomposition (representing a monocentromere in the alphabet of HORs) are currently viewed as two separate problems, we show that they should be integrated into a single framework in such a way that HOR (monomer) inference affects monomer (HOR) inference. We thus developed the HORmon algorithm that integrates the monomer/HOR inference and automatically generates the human monomers/HORs that are largely consistent with the previous semi-manual inference.

Citing Articles

Efficient genome monomer higher-order structure annotation and identification using the GRMhor algorithm.

Gluncic M, Baric D, Paar V Bioinform Adv. 2024; 4(1):vbae191.

PMID: 39659587 PMC: 11630843. DOI: 10.1093/bioadv/vbae191.


Novel Cascade Alpha Satellite HORs in Orangutan Chromosome 13 Assembly: Discovery of the 59mer HOR-The largest Unit in Primates-And the Missing Triplet 45/27/18 HOR in Human T2T-CHM13v2.0 Assembly.

Gluncic M, Vlahovic I, Rosandic M, Paar V Int J Mol Sci. 2024; 25(14).

PMID: 39062839 PMC: 11276891. DOI: 10.3390/ijms25147596.


Precise identification of cascading alpha satellite higher order repeats in T2T-CHM13 assembly of human chromosome 3.

Gluncic M, Vlahovic I, Rosandic M, Paar V Croat Med J. 2024; 65(3):209-219.

PMID: 38868967 PMC: 11157248.


The Satellite DNA PcH-Sat, Isolated and Characterized in the Limpet (Mollusca, Gastropoda), Suggests the Origin from a Nin-SINE Transposable Element.

Petraccioli A, Maio N, Carotenuto R, Odierna G, Guarino F Genes (Basel). 2024; 15(5).

PMID: 38790169 PMC: 11121367. DOI: 10.3390/genes15050541.


Novel Concept of Alpha Satellite Cascading Higher-Order Repeats (HORs) and Precise Identification of 15mer and 20mer Cascading HORs in Complete T2T-CHM13 Assembly of Human Chromosome 15.

Gluncic M, Vlahovic I, Rosandic M, Paar V Int J Mol Sci. 2024; 25(8).

PMID: 38673983 PMC: 11050224. DOI: 10.3390/ijms25084395.


References
1.
Bzikadze A, Pevzner P . Automated assembly of centromeres from ultra-long error-prone reads. Nat Biotechnol. 2020; 38(11):1309-1316. PMC: 10718184. DOI: 10.1038/s41587-020-0582-4. View

2.
Smith G . Evolution of repeated DNA sequences by unequal crossover. Science. 1976; 191(4227):528-35. DOI: 10.1126/science.1251186. View

3.
Waye J, Willard H . Chromosome-specific alpha satellite DNA: nucleotide sequence analysis of the 2.0 kilobasepair repeat from the human X chromosome. Nucleic Acids Res. 1985; 13(8):2731-43. PMC: 341190. DOI: 10.1093/nar/13.8.2731. View

4.
Paar V, Vlahovic I, Rosandic M, Gluncic M . Global Repeat Map (GRM): Advantageous Method for Discovery of Largest Higher-Order Repeats (HORs) in Neuroblastoma Breakpoint Family (NBPF) Genes, in Hornerin Exon and in Chromosome 21 Centromere. Prog Mol Subcell Biol. 2021; 60:203-234. DOI: 10.1007/978-3-030-74889-0_8. View

5.
Xue L, Gao Y, Wu M, Tian T, Fan H, Huang Y . Telomere-to-telomere assembly of a fish Y chromosome reveals the origin of a young sex chromosome pair. Genome Biol. 2021; 22(1):203. PMC: 8273981. DOI: 10.1186/s13059-021-02430-y. View