Automated Annotation of Human Centromeres with HORmon
Overview
Affiliations
Recent advances in long-read sequencing opened a possibility to address the long-standing questions about the architecture and evolution of human centromeres. They also emphasized the need for centromere annotation (partitioning human centromeres into monomers and higher-order repeats [HORs]). Although there was a half-century-long series of semi-manual studies of centromere architecture, a rigorous centromere annotation algorithm is still lacking. Moreover, an automated centromere annotation is a prerequisite for studies of genetic diseases associated with centromeres and evolutionary studies of centromeres across multiple species. Although the monomer decomposition (transforming a centromere into a monocentromere written in the monomer alphabet) and the HOR decomposition (representing a monocentromere in the alphabet of HORs) are currently viewed as two separate problems, we show that they should be integrated into a single framework in such a way that HOR (monomer) inference affects monomer (HOR) inference. We thus developed the HORmon algorithm that integrates the monomer/HOR inference and automatically generates the human monomers/HORs that are largely consistent with the previous semi-manual inference.
Gluncic M, Baric D, Paar V Bioinform Adv. 2024; 4(1):vbae191.
PMID: 39659587 PMC: 11630843. DOI: 10.1093/bioadv/vbae191.
Gluncic M, Vlahovic I, Rosandic M, Paar V Int J Mol Sci. 2024; 25(14).
PMID: 39062839 PMC: 11276891. DOI: 10.3390/ijms25147596.
Gluncic M, Vlahovic I, Rosandic M, Paar V Croat Med J. 2024; 65(3):209-219.
PMID: 38868967 PMC: 11157248.
Petraccioli A, Maio N, Carotenuto R, Odierna G, Guarino F Genes (Basel). 2024; 15(5).
PMID: 38790169 PMC: 11121367. DOI: 10.3390/genes15050541.
Gluncic M, Vlahovic I, Rosandic M, Paar V Int J Mol Sci. 2024; 25(8).
PMID: 38673983 PMC: 11050224. DOI: 10.3390/ijms25084395.