» Articles » PMID: 33828295

The Structure, Function and Evolution of a Complete Human Chromosome 8

Abstract

The complete assembly of each human chromosome is essential for understanding human biology and evolution. Here we use complementary long-read sequencing technologies to complete the linear assembly of human chromosome 8. Our assembly resolves the sequence of five previously long-standing gaps, including a 2.08-Mb centromeric α-satellite array, a 644-kb copy number polymorphism in the β-defensin gene cluster that is important for disease risk, and an 863-kb variable number tandem repeat at chromosome 8q21.2 that can function as a neocentromere. We show that the centromeric α-satellite array is generally methylated except for a 73-kb hypomethylated region of diverse higher-order α-satellites enriched with CENP-A nucleosomes, consistent with the location of the kinetochore. In addition, we confirm the overall organization and methylation pattern of the centromere in a diploid human genome. Using a dual long-read sequencing approach, we complete high-quality draft assemblies of the orthologous centromere from chromosome 8 in chimpanzee, orangutan and macaque to reconstruct its evolutionary history. Comparative and phylogenetic analyses show that the higher-order α-satellite structure evolved in the great ape ancestor with a layered symmetry, in which more ancient higher-order repeats locate peripherally to monomeric α-satellites. We estimate that the mutation rate of centromeric satellite DNA is accelerated by more than 2.2-fold compared to the unique portions of the genome, and this acceleration extends into the flanking sequence.

Citing Articles

Integrated analysis of the complete sequence of a macaque genome.

Zhang S, Xu N, Fu L, Yang X, Ma K, Li Y Nature. 2025; .

PMID: 40011769 DOI: 10.1038/s41586-025-08596-w.


The homologous recombination factors BRCA2 and PALB2 interplay with mismatch repair pathways to maintain centromere stability and cell viability.

Graham E, Rampazzo L, Leung C, Wall J, Gerocz E, Liskovykh M Cell Rep. 2025; 44(2):115259.

PMID: 39893637 PMC: 11860765. DOI: 10.1016/j.celrep.2025.115259.


Centromeric chromatin clearings demarcate the site of kinetochore formation.

Kixmoeller K, Tarasovetc E, Mer E, Chang Y, Black B Cell. 2025; 188(5):1280-1296.e19.

PMID: 39855195 PMC: 11890969. DOI: 10.1016/j.cell.2024.12.025.


SUMMER: an integrated nanopore sequencing pipeline for variants detection and clinical annotation on the human genome.

Li R, Chu H, Gao K, Luo H, Jiang Y Funct Integr Genomics. 2025; 25(1):21.

PMID: 39836277 PMC: 11750885. DOI: 10.1007/s10142-025-01534-z.


Mumemto: efficient maximal matching across pangenomes.

Shivakumar V, Langmead B bioRxiv. 2025; .

PMID: 39803467 PMC: 11722392. DOI: 10.1101/2025.01.05.631388.


References
1.
Venter J, Adams M, Myers E, Li P, Mural R, Sutton G . The sequence of the human genome. Science. 2001; 291(5507):1304-51. DOI: 10.1126/science.1058040. View

2.
Alkan C, Cardone M, Catacchio C, Antonacci F, OBrien S, Ryder O . Genome-wide characterization of centromeric satellites from multiple mammalian genomes. Genome Res. 2010; 21(1):137-45. PMC: 3012921. DOI: 10.1101/gr.111278.110. View

3.
Cheng H, Concepcion G, Feng X, Zhang H, Li H . Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021; 18(2):170-175. PMC: 7961889. DOI: 10.1038/s41592-020-01056-5. View

4.
Logsdon G, Vollger M, Eichler E . Long-read human genome sequencing and its applications. Nat Rev Genet. 2020; 21(10):597-614. PMC: 7877196. DOI: 10.1038/s41576-020-0236-x. View

5.
McNulty S, Sullivan B . Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Res. 2018; 26(3):115-138. PMC: 6121732. DOI: 10.1007/s10577-018-9582-3. View