» Articles » PMID: 36932816

Complete Sequences of Six Major Histocompatibility Complex Haplotypes, Including All the Major MHC Class II Structures

Abstract

Accurate and comprehensive immunogenetic reference panels are key to the successful implementation of population-scale immunogenomics. The 5Mbp Major Histocompatibility Complex (MHC) is the most polymorphic region of the human genome and associated with multiple immune-mediated diseases, transplant matching and therapy responses. Analysis of MHC genetic variation is severely complicated by complex patterns of sequence variation, linkage disequilibrium and a lack of fully resolved MHC reference haplotypes, increasing the risk of spurious findings on analyzing this medically important region. Integrating Illumina, ultra-long Nanopore, and PacBio HiFi sequencing as well as bespoke bioinformatics, we completed five of the alternative MHC reference haplotypes of the current (GRCh38/hg38) build of the human reference genome and added one other. The six assembled MHC haplotypes encompass the DR1 and DR4 haplotype structures in addition to the previously completed DR2 and DR3, as well as six distinct classes of the structurally variable C4 region. Analysis of the assembled haplotypes showed that MHC class II sequence structures, including repeat element positions, are generally conserved within the DR haplotype supergroups, and that sequence diversity peaks in three regions around HLA-A, HLA-B+C, and the HLA class II genes. Demonstrating the potential for improved short-read analysis, the number of proper read pairs recruited to the MHC was found to be increased by 0.06%-0.49% in a 1000 Genomes Project read remapping experiment with seven diverse samples. Furthermore, the assembled haplotypes can serve as references for the community and provide the basis of a structurally accurate genotyping graph of the complete MHC region.

Citing Articles

Unraveling the architecture of major histocompatibility complex class II haplotypes in rhesus macaques.

de Groot N, van der Wiel M, Le N, de Groot N, Bruijnesteijn J, Bontrop R Genome Res. 2024; 34(11):1811-1824.

PMID: 39443153 PMC: 11610599. DOI: 10.1101/gr.278968.124.


MHConstructor: a high-throughput, haplotype-informed solution to the MHC assembly challenge.

Wade K, Suseno R, Kizer K, Williams J, Boquett J, Caillier S Genome Biol. 2024; 25(1):274.

PMID: 39420419 PMC: 11484429. DOI: 10.1186/s13059-024-03412-6.


Complex genetic variation in nearly complete human genomes.

Logsdon G, Ebert P, Audano P, Loftus M, Porubsky D, Ebler J bioRxiv. 2024; .

PMID: 39372794 PMC: 11451754. DOI: 10.1101/2024.09.24.614721.


Targeted and complete genomic sequencing of the major histocompatibility complex in haplotypic form of individual heterozygous samples.

Hu T, Mosbruger T, Tairis N, Dinou A, Jayaraman P, Sarmady M Genome Res. 2024; 34(10):1500-1513.

PMID: 39327030 PMC: 11534196. DOI: 10.1101/gr.278588.123.


DNA structural features and variability of complete MHC locus sequences.

Wassenaar T, Harville T, Chastain J, Wanchai V, Ussery D Front Bioinform. 2024; 4:1392613.

PMID: 39022183 PMC: 11251971. DOI: 10.3389/fbinf.2024.1392613.


References
1.
Byrska-Bishop M, Evani U, Zhao X, Basile A, Abel H, Regier A . High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell. 2022; 185(18):3426-3440.e19. PMC: 9439720. DOI: 10.1016/j.cell.2022.08.004. View

2.
Poplin R, Chang P, Alexander D, Schwartz S, Colthurst T, Ku A . A universal SNP and small-indel variant caller using deep neural networks. Nat Biotechnol. 2018; 36(10):983-987. DOI: 10.1038/nbt.4235. View

3.
Eggertsson H, Jonsson H, Kristmundsdottir S, Hjartarson E, Kehr B, Masson G . Graphtyper enables population-scale genotyping using pangenome graphs. Nat Genet. 2017; 49(11):1654-1660. DOI: 10.1038/ng.3964. View

4.
Wu Y, Savelli S, Yang Y, Zhou B, Rovin B, Birmingham D . Sensitive and specific real-time polymerase chain reaction assays to accurately determine copy number variations (CNVs) of human complement C4A, C4B, C4-long, C4-short, and RCCX modules: elucidation of C4 CNVs in 50 consanguineous subjects with.... J Immunol. 2007; 179(5):3012-25. DOI: 10.4049/jimmunol.179.5.3012. View

5.
Schneider V, Graves-Lindsay T, Howe K, Bouk N, Chen H, Kitts P . Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res. 2017; 27(5):849-864. PMC: 5411779. DOI: 10.1101/gr.213611.116. View