A Comprehensively Molecular Haplotype-resolved Genome of a European Individual
Overview
Authors
Affiliations
Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, "Max Planck One" (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ~1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying "haploid landscapes," which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for "phase-sensitive" personal genomics. MP1's annotated haploid genomes are available as a public resource.
Duitama J Methods Mol Biol. 2022; 2590:273-286.
PMID: 36335504 DOI: 10.1007/978-1-0716-2819-5_16.
Hoehe M, Herwig R Methods Mol Biol. 2022; 2590:237-272.
PMID: 36335503 DOI: 10.1007/978-1-0716-2819-5_15.
Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.
Hu Y, Yang C, Zhang L, Zhou X Methods Mol Biol. 2022; 2590:161-182.
PMID: 36335499 DOI: 10.1007/978-1-0716-2819-5_11.
A Simple Cost-Effective Method for Whole-Genome Sequencing, Haplotyping, and Assembly.
Wang O, Cheng X, Drmanac R, Peters B Methods Mol Biol. 2022; 2590:101-125.
PMID: 36335495 DOI: 10.1007/978-1-0716-2819-5_7.
Masset H, Ding J, Dimitriadou E, Debrock S, Tsuiko O, Smits K Nucleic Acids Res. 2022; 50(11):e63.
PMID: 35212381 PMC: 9226495. DOI: 10.1093/nar/gkac134.