» Articles » PMID: 39821835

Long-read Sequencing Reveals Novel Genetic Polymorphisms in the Major Histocompatibility Complex Region and Their Impacts on the Han Chinese Population

Overview
Date 2025 Jan 17
PMID 39821835
Authors
Affiliations
Soon will be listed here.
Abstract

Human leukocyte antigen (HLA) genes in the major histocompatibility complex (MHC) region are crucial for immunity and are associated with numerous diseases and phenotypes. The MHC region's complexity and high genetic diversity make it challenging to analyze using short-read sequencing (SRS) technology. We sequence the MHC region of 100 Han Chinese individuals using both long-read sequencing (LRS) and SRS platforms at approximately 30X coverage to study genetic alterations and their potential functional impacts. LRS provides significantly greater coverage of the MHC region and eight classical HLA genes, particularly at the HLA-DRB1 locus, compared with SRS. We detect 78,249 single nucleotide polymorphisms (SNPs) using LRS, with 26.0% undetectable by SRS. Based on SNP and inferred HLA allele types, we construct an LRS-based MHC reference panel for the Han Chinese, containing approximately 2.6 times more genetic variants than the SRS-based Han-MHC reference panel. A phenome-wide association study assessing 26,024 phenotypes across 15 categories identifies significant associations for 7,879 independent variants (including 809 LRS-specific SNPs) with 409 phenotypes in nine categories. This analysis reveals 24 unreported HLA allele associations in the bioelectric and cellular categories. The conditional analysis identifies 530 independent signals across the 409 phenotypes, including 28 previously unreported signals of eight classical HLA genes associated with 33 phenotypes. Of the top-associated SNPs, 191 are detected by LRS only. Fine-mapping identifies 126 independent candidate causal SNPs for three immune-related cellular phenotypes, with 17 detected exclusively by LRS. Our study reveals previously unreported variants and their functional impacts in the MHC region, enhancing our understanding of genetic diversity and its potential biological implications in the Han Chinese population.

References
1.
Bailey J, Gu Z, Clark R, Reinert K, Samonte R, Schwartz S . Recent segmental duplications in the human genome. Science. 2002; 297(5583):1003-7. DOI: 10.1126/science.1072047. View

2.
Bailey J, Yavor A, Massa H, Trask B, Eichler E . Segmental duplications: organization and impact within the current human genome project assembly. Genome Res. 2001; 11(6):1005-17. PMC: 311093. DOI: 10.1101/gr.gr-1871r. View

3.
Benner C, Spencer C, Havulinna A, Salomaa V, Ripatti S, Pirinen M . FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics. 2016; 32(10):1493-501. PMC: 4866522. DOI: 10.1093/bioinformatics/btw018. View

4.
Cao H, Wu J, Wang Y, Jiang H, Zhang T, Liu X . An integrated tool to study MHC region: accurate SNV detection and HLA genes typing in human MHC region using targeted high-throughput sequencing. PLoS One. 2013; 8(7):e69388. PMC: 3722289. DOI: 10.1371/journal.pone.0069388. View

5.
Chin C, Wagner J, Zeng Q, Garrison E, Garg S, Fungtammasan A . A diploid assembly-based benchmark for variants in the major histocompatibility complex. Nat Commun. 2020; 11(1):4794. PMC: 7508831. DOI: 10.1038/s41467-020-18564-9. View