» Articles » PMID: 28360230

Sequences of 95 Human Haplotypes Reveal Extreme Coding Variation in Genes Other Than Highly Polymorphic and

Abstract

The most polymorphic part of the human genome, the encodes over 160 proteins of diverse function. Half of them, including the and genes, are directly involved in immune responses. Consequently, the region strongly associates with numerous diseases and clinical therapies. Notoriously, the region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp region from genomic DNA. For 95 homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the region shows the approach accurately determines the sequences of the highly polymorphic and genes and the complex structural diversity of complement factor It has also uncovered extensive and unexpected diversity in other genes; an example is , which encodes a lung mucin and exhibits more coding sequence alleles than any or gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome.

Citing Articles

MHConstructor: a high-throughput, haplotype-informed solution to the MHC assembly challenge.

Wade K, Suseno R, Kizer K, Williams J, Boquett J, Caillier S Genome Biol. 2024; 25(1):274.

PMID: 39420419 PMC: 11484429. DOI: 10.1186/s13059-024-03412-6.


Complex genetic variation in nearly complete human genomes.

Logsdon G, Ebert P, Audano P, Loftus M, Porubsky D, Ebler J bioRxiv. 2024; .

PMID: 39372794 PMC: 11451754. DOI: 10.1101/2024.09.24.614721.


HLA variants and inhibitor development in hemophilia A: results from the HEMFIL study group.

Santana M, Chaves D, Souza F, Rezende S Hematol Transfus Cell Ther. 2024; 46 Suppl 6:S431-S434.

PMID: 39358091 PMC: 11726080. DOI: 10.1016/j.htct.2024.05.015.


Targeted and complete genomic sequencing of the major histocompatibility complex in haplotypic form of individual heterozygous samples.

Hu T, Mosbruger T, Tairis N, Dinou A, Jayaraman P, Sarmady M Genome Res. 2024; 34(10):1500-1513.

PMID: 39327030 PMC: 11534196. DOI: 10.1101/gr.278588.123.


The 18th International HLA & Immunogenetics workshop project report: Creating fully representative MHC reference haplotypes.

Pollock N, Farias T, Kichula K, Sauter J, Scholz S, Nii-Trebi N HLA. 2024; 103(6):e15568.

PMID: 38923286 PMC: 11210686. DOI: 10.1111/tan.15568.


References
1.
de Bakker P, Raychaudhuri S . Interrogating the major histocompatibility complex with high-throughput genomics. Hum Mol Genet. 2012; 21(R1):R29-36. PMC: 3459647. DOI: 10.1093/hmg/dds384. View

2.
Wu Y, Savelli S, Yang Y, Zhou B, Rovin B, Birmingham D . Sensitive and specific real-time polymerase chain reaction assays to accurately determine copy number variations (CNVs) of human complement C4A, C4B, C4-long, C4-short, and RCCX modules: elucidation of C4 CNVs in 50 consanguineous subjects with.... J Immunol. 2007; 179(5):3012-25. DOI: 10.4049/jimmunol.179.5.3012. View

3.
MAYER W, Jonker M, Klein D, Ivanyi P, van Seventer G, Klein J . Nucleotide sequences of chimpanzee MHC class I alleles: evidence for trans-species mode of evolution. EMBO J. 1988; 7(9):2765-74. PMC: 457067. DOI: 10.1002/j.1460-2075.1988.tb03131.x. View

4.
Stewart C, Horton R, Allcock R, Ashurst J, Atrazhev A, Coggill P . Complete MHC haplotype sequencing for common disease gene mapping. Genome Res. 2004; 14(6):1176-87. PMC: 419796. DOI: 10.1101/gr.2188104. View

5.
Hijikata M, Matsushita I, Tanaka G, Tsuchiya T, Ito H, Tokunaga K . Molecular cloning of two novel mucin-like genes in the disease-susceptibility locus for diffuse panbronchiolitis. Hum Genet. 2010; 129(2):117-28. DOI: 10.1007/s00439-010-0906-4. View