» Articles » PMID: 21813624

A Comprehensively Molecular Haplotype-resolved Genome of a European Individual

Overview
Journal Genome Res
Specialty Genetics
Date 2011 Aug 5
PMID 21813624
Citations 48
Authors
Affiliations
Soon will be listed here.
Abstract

Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, "Max Planck One" (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ~1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying "haploid landscapes," which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for "phase-sensitive" personal genomics. MP1's annotated haploid genomes are available as a public resource.

Citing Articles

Phased Genome Assemblies.

Duitama J Methods Mol Biol. 2022; 2590:273-286.

PMID: 36335504 DOI: 10.1007/978-1-0716-2819-5_16.


Analysis of 1276 Haplotype-Resolved Genomes Allows Characterization of Cis- and Trans-Abundant Genes.

Hoehe M, Herwig R Methods Mol Biol. 2022; 2590:237-272.

PMID: 36335503 DOI: 10.1007/978-1-0716-2819-5_15.


Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.

Hu Y, Yang C, Zhang L, Zhou X Methods Mol Biol. 2022; 2590:161-182.

PMID: 36335499 DOI: 10.1007/978-1-0716-2819-5_11.


A Simple Cost-Effective Method for Whole-Genome Sequencing, Haplotyping, and Assembly.

Wang O, Cheng X, Drmanac R, Peters B Methods Mol Biol. 2022; 2590:101-125.

PMID: 36335495 DOI: 10.1007/978-1-0716-2819-5_7.


Single-cell genome-wide concurrent haplotyping and copy-number profiling through genotyping-by-sequencing.

Masset H, Ding J, Dimitriadou E, Debrock S, Tsuiko O, Smits K Nucleic Acids Res. 2022; 50(11):e63.

PMID: 35212381 PMC: 9226495. DOI: 10.1093/nar/gkac134.


References
1.
Wheeler D, Srinivasan M, Egholm M, Shen Y, Chen L, McGuire A . The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008; 452(7189):872-6. DOI: 10.1038/nature06884. View

2.
Wray G . The evolutionary significance of cis-regulatory mutations. Nat Rev Genet. 2007; 8(3):206-16. DOI: 10.1038/nrg2063. View

3.
Epstein D . Cis-regulatory mutations in human disease. Brief Funct Genomic Proteomic. 2009; 8(4):310-6. PMC: 2742803. DOI: 10.1093/bfgp/elp021. View

4.
Kidd J, Cheng Z, Graves T, Fulton B, Wilson R, Eichler E . Haplotype sorting using human fosmid clone end-sequence pairs. Genome Res. 2008; 18(12):2016-23. PMC: 2593576. DOI: 10.1101/gr.081786.108. View

5.
Huntzinger E, Braun J, Heimstadt S, Zekri L, Izaurralde E . Two PABPC1-binding sites in GW182 proteins promote miRNA-mediated gene silencing. EMBO J. 2010; 29(24):4146-60. PMC: 3018788. DOI: 10.1038/emboj.2010.274. View