Haplotype Sorting Using Human Fosmid Clone End-sequence Pairs
Overview
Affiliations
An important goal of human genetics and genomics is to understand the complete spectrum of genetic variation across a specific human haplotype. By combining information from a dense SNP map with fosmid end-sequence pairs (ESPs) aligned to the human genome reference sequence, we have developed a simple method to resolve human haplotypes using a previously developed clone resource. By partitioning ESPs into either haplotype, we have generated a haplotype-specific clone map for eight diploid genomes (four Yoruba African and four non-African samples). On average, 59% of each haploid genome is covered by haplotype-assigned clones with an N50 length of 110 kbp. By comparing this clone-based haplotype map against HapMap phased data sets, we estimate an error rate of 0.71% when trio information is available and 6.6% in its absence. We present these data in the form of an interactive browser that allows clones corresponding to specific haplotypes to be recovered and sequenced within these eight human genomes. As an example, we sequenced 165 fosmid clone inserts to generate 6.8 Mbp of sequenced haplotypes, and demonstrate its utility in uncovering phase-switching errors and for the discovery of novel SNPs especially in Asian and African samples. We discuss the potential application of this resource in understanding the pattern of genetic variation in complex regions of the genome that may not be adequately resolved by next-generation sequencing technology or SNP haplotype imputation.
A New Fast Phasing Method Based On Haplotype Subtraction.
Mocci E, Debeljak M, Klein A, Eshleman J J Mol Diagn. 2019; 21(3):427-436.
PMID: 30872187 PMC: 6504677. DOI: 10.1016/j.jmoldx.2018.12.004.
Dhillon T, Morohashi K, Stockinger E Plant Mol Biol. 2017; 94(3):333-347.
PMID: 28434151 DOI: 10.1007/s11103-017-0610-z.
Modeling Human Population Separation History Using Physically Phased Genomes.
Song S, Sliwerska E, Emery S, Kidd J Genetics. 2017; 205(1):385-395.
PMID: 28049708 PMC: 5223516. DOI: 10.1534/genetics.116.192963.
Genomic pathology of SLE-associated copy-number variation at the FCGR2C/FCGR3B/FCGR2B locus.
Mueller M, Barros P, Witherden A, Roberts A, Zhang Z, Schaschl H Am J Hum Genet. 2012; 92(1):28-40.
PMID: 23261299 PMC: 3542466. DOI: 10.1016/j.ajhg.2012.11.013.
Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.
Tyson J, Armour J BMC Genomics. 2012; 13:693.
PMID: 23231411 PMC: 3543183. DOI: 10.1186/1471-2164-13-693.